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(54) Title: G-PROTEIN COUPLED RECEPTORS 

(57) Abstract: The invention provides human G-prxHein coupled receptors (GCREC) and polynucleotides which identify and en- 
code GCREC. The invention also provides expression vectors, host cells, antibodies, agonists, and antagonists. The invention also 
provides methods for diagnosing, treating, or preventing disorders associated with aberrant expression of GCREC. 
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the interaction of the activated GPCR with ion channel proteins. (See, e.g., Watson, S. and S. 
Arkinstall (1994) The G-protein Linked Receptor Facts Book , Academic Press, San Diego CA, pp. 2- 
6; Bolander, F.F. (1994) Molecular Endocrinology , Academic Press, San Diego CA, pp. 162-176; 
Baldwin, J.M. (1994) Curr Opin. Cell Biol. 6:180-190.) 
5 GPCRs include receptors for sensory signal mediators (e.g., light and olfactory stimulatory 

molecules); adenosine, y-aminobutyric acid (GAB A), hepatocyte growth factor, melanocortins, 
neuropeptide Y, opioid peptides, opsins, somatostatin, tachykinins, vasoactive intestinal polypeptide 
family, and vasopressin; biogenic amines (e.g., dopamine, epinephrine and norepinephrine, histamine, 
glutamate (metabotropic effect), acetylcholine (muscarinic effect), and serotonin); chemokines; lipid 

10 mediators of inflammation (e.g., prostaglandins and prostanoids, platelet activating factor, and 
leukotrienes); and peptide hormones (e.g., bombesin, bradykinin, calcitonin, C5a anaphylatoxin, 
endothelin, follicle-stimulating hormone (FSH), gonadotropic-releasing hormone (GnRH), 
neurokinin, and thyrotropin-releasing hormone (TRH), and oxytocin). GPCRs which act as receptors 
for stimuli that have yet to be identified are known as orphan receptors. 

15 The diversity of the GPCR family is further increased by alternative splicing. Many GPCR 

genes contain introns, and there are currently over 30 such receptors for which splice variants have 
been identified. The largest number of variations are at the protein C-terminus. N-terminal and 
cytoplasmic loop variants are also frequent, while variants in the extracellular loops or 
transmembrane domains are less common. Some receptors have more than one site at which variance 

20 can occur. The splicing variants appear to be functionally distinct, based upon observed differences 
in distribution, signaling, coupling, regulation, and ligand binding profiles (Kilpatrick, G J. et al. 
(1999) Trends Pharmacol. Sci. 20:294-301). 

GPCRs can be divided into three major subfamilies: the rhodopsin-like, secretin-like, and 
metabotropic glutamate receptor subfamilies. Members of these GPCR subfamilies share similar 

25 functions and the characteristic seven transmembrane structure, but have divergent amino acid 
sequences. The largest family consists of the rhodopsin-like GPCRs, which transmit diverse 
extracellular signals including hormones, neurotransmitters, and light. Rhodopsin is a photosensitive 
GPCR found in animal retinas. In vertebrates, rhodopsin molecules are embedded in membranous 
stacks found in photoreceptor (rod) cells. Each rhodopsin molecule responds to a photon of light by 

30 triggering a decrease in cGMP levels which leads to the closure of plasma membrane sodium 
channels. In this manner, a visual signal is converted to a neural impulse. Other rhodopsin-like 
GPCRs are directly involved in responding to neurotransmitters. These GPCRs include the receptors 
for adrenaline (adrenergic receptors), acetylcholine (muscarinic receptors), adenosine, galanin, and 
glutamate (N-methyl-D-aspartate/NMDA receptors). (Reviewed in Watson, S. and S. Arkinstall 

35 (1994) The G-Protein Linked Receptor Facts Book . Academic Press, San Diego CA, pp. 7-9, 19-22, 
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major excitatory neurotransmitter in the central nervous system. The metabotropic glutamate 
receptors modulate the activity of intracellular effectors, and are involved in long-term potentiation 
(Watson, supra , p. 130). The Ca 2+ -sensing receptor, which senses changes in the extracellular 
concentration of calcium ions, has a large extracellular domain including clusters of acidic amino 
5 acids which may be involved in calcium binding. The metabotropic glutamate receptor family also 
includes pheromone receptors, the GABA B receptors, and the taste receptors. 

Other subfamilies of GPCRs include two groups of chemoreceptor genes found in the 
nematodes Caenorhabditis elegans and Caenorhabditis briggsae , which are distantly related to the 
mammalian olfactory receptor genes. The yeast pheromone receptors STE2 and STE3, involved in 
10 the response to mating factors on the cell membrane, have their own seven-transmembrane signature, 
as do the cAMP receptors from the slime mold Dictvostelium discoideum . which are thought to 
regulate the aggregation of individual cells and control the expression of numerous developmentally- 
regulated genes. 

GPCR mutations, which may cause loss of function or constitutive activation, have been 

15 associated with numerous human diseases (Coughlin, supra) . For instance, retinitis pigmentosa may 
arise from mutations in the rhodopsin gene. Furthermore, somatic activating imitations in the 
thyrotropin receptor have been reported to cause hyperfunctioning thyroid adenomas, suggesting that 
certain GPCRs susceptible to constitutive activation may behave as protooncogenes (Parma, J. et al, 
(1993) Nature 365:649-651). GPCR receptors for the following ligands also contain mutations 

20 associated with human disease: luteinizing hormone (precocious puberty); vasopressin V 2 (X-linked 
nephrogenic diabetes); glucagon (diabetes and hypertension); calcium (hyperparathyroidism, 
hypocalcuria, hypercalcemia); parathyroid hormone (short limbed dwarfism); (^-adrenoceptor 
(obesity, non-insulin-dependent diabetes mellitus); growth hormone releasing hormone (dwarfism); 
and adrenocorticotropin (glucocorticoid deficiency) (Wilson, S. et al. (1998) Br. J. PharmocoL- 

25 1 25:1387-1392; Stadel, J.M. et al. (1997) Trends Pharmacol. Sci. 18:430-437). GPCRs are also 
involved in depression, schizophrenia, sleeplessness, hypertension, anxiety, stress, renal failure, and 
several cardiovascular disorders (Horn, F. and G. Vriend (1998) J. Mol. Med. 76:464-468). 

In addition, within the past 20 years several hundred new drugs have been recognized that are 
directed towards activating or inhibiting GPCRs. The therapeutic targets of these drugs span a wide 

30 range of diseases and disorders, including cardiovascular, gastrointestinal, and central nervous system 
disorders as well as cancer, osteoporosis and endometriosis (Wilson, supra ; Stadel, supra) . For 
example, the dopamine agonist L-dopa is used to treat Parkinson's disease, while a dopamine 
antagonist is used to treat schizophrenia and the early stages of Huntington's disease. Agonists and 
antagonists of adrenoceptors have been used for the treatment of asthma, high blood pressure, other 

35 cardiovascular disorders, and anxiety; muscarinic agonists are used in the treatment of glaucoma and 
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collectively as "GCREC" and individually as "GCREC-1," "GCREC-2," "GCREC-3," "GCREC-4," 
"GCREC-5," "GCREC-6," "GCREC-7 " and "GCREC-8." In one aspect, the invention provides an 
isolated polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-8, b) a naturally occurring polypeptide 
5 comprising an amino acid sequence at least 90% identical to an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-8, c) a biologically active fragment of a polypeptide having an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-8, and d) an immunogenic 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-8. In one alternative, the invention provides an isolated polypeptide comprising the amino 

10 acid sequence of SEQ ID NO: 1-8. 

The invention further provides an isolated polynucleotide encoding a polypeptide selected 
from the group consisting of a) a polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-8, b) a naturally occurring polypeptide comprising an amino acid 
sequence at least 90% identical to an amino acid sequence selected from the group consisting of SEQ 

15 ID NO: 1-8, c) a biologically active fragment of a polypeptide having an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-8, and d) an immunogenic fragment of a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-8. In one 
alternative, the polynucleotide encodes a polypeptide selected from the group consisting of SEQ ID 
NO: 1-8. In another alternative, the polynucleotide is selected from the group consisting of SEQ ED 

20 NO:9-16. 

Additionally, the invention provides a recombinant polynucleotide comprising a promoter 
sequence operably linked to a polynucleotide encoding a polypeptide selected from the group 
consisting of a) a polypeptide comprising an amino acid sequence selected from the group consisting 
of SEQ ID NO: 1-8, b) a naturally occurring polypeptide comprising an amino acid sequence at least 

25 90% identical to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-8, c) a 
biologically active fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-8, and d) an immunogenic fragment of a polypeptide having an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-8. In one alternative, the invention 
provides a cell transformed with the recombinant polynucleotide. In another alternative, the invention 

30 provides a transgenic organism comprising the recombinant polynucleotide. 

The invention also provides a method for producing a polypeptide selected from the group 
consisting of a) a polypeptide comprising an amino acid sequence selected from the group consisting 
of SEQ ID NO: 1-8, b) a naturally occurring polypeptide comprising an amino acid sequence at least 
90% identical to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-8, c) a 

35 biologically active fragment of a polypeptide having an amino acid sequence selected from the group 
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of a) a polynucleotide comprising a polynucleotide sequence selected from the group consisting of 
SEQ ID NO:9-16, b) a naturally occurring polynucleotide comprising a polynucleotide sequence at 
least 90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID NO:9- 
16, c) a polynucleotide complementary to the polynucleotide of a), d) a polynucleotide 

5 complementary to the polynucleotide of b), and e) an RNA equivalent of a)-d). The method 
comprises a) amplifying said target polynucleotide or fragment thereof using polymerase chain 
reaction amplification, and b) detecting the presence or absence of said amplified target 
polynucleotide or fragment thereof, and, optionally, if present, the amount thereof. 

The invention further provides a composition comprising an effective amount of a 

10 polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid 

sequence selected from the group consisting of SEQ ID NO; 1-8, b) a naturally occurring polypeptide 
comprising an amino acid sequence at least 90% identical to an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-8, c) a biologically active fragment of a polypeptide having an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-8, and d) an immunogenic 

15 fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-8, and a pharmaceutical^ acceptable excipient. In one embodiment, the composition 
comprises an amino acid sequence selected from the group consisting of SEQ ID NO: 1-8. The 
invention additionally provides a method of treating a disease or condition associated with decreased 
expression of functional GCREC, comprising administering to a patient in need of such treatment the 

20 composition. 

The invention also provides a method for screening a compound for effectiveness as an 
agonist of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-8, b) a naturally occurring 
polypeptide comprising an amino acid sequence at least 90% identical to an amino acid sequence 

25 selected from the group consisting of SEQ ID NO: 1-8, c) a biologically active fragment of a 

polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-8, 
and d) an immunogenic fragment of a polypeptide having an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-8. The method comprises a) exposing a sample comprising the 
polypeptide to a compound, and b) detecting agonist activity in the sample. In one alternative, the 

30 invention provides a composition comprising an agonist compound identified by the method and a 
pharmaceutically acceptable excipient. In another alternative, the invention provides a method of 
treating a disease or condition associated with decreased expression of functional GCREC, 
comprising administering to a patient in need of such treatment the composition. 

Additionally, the invention provides a method for screening a compound for effectiveness as 

35 an antagonist of a polypeptide selected from the group consisting of a) a polypeptide comprising an 
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altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
sequence selected from the group consisting of SEQ ID NO:9-16, the method comprising a) exposing 
a sample comprising the target polynucleotide to a compound, and b) detecting altered expression of 
the target polynucleotide. 
5 The invention further provides a method for assessing toxicity of a test compound, said 

method comprising a) treating a biological sample containing nucleic acids with the test compound; 
b) hybridizing the nucleic acids of the treated biological sample with a probe comprising at least 20 
contiguous nucleotides of a polynucleotide selected from the group consisting of i) a polynucleotide 
comprising a polynucleotide sequence selected from the group consisting of SEQ ID N0:9-16, ii) a 

10 naturally occurring polynucleotide comprising a polynucleotide sequence at least 90% identical to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:9-16, iii) a polynucleotide 
having a sequence complementary to i), iv) a polynucleotide complementary to the polynucleotide of 
ii), and v) an RNA equivalent of i)-iv). Hybridization occurs under conditions whereby a specific 
hybridization complex is formed between said probe and a target polynucleotide in the biological 

15 sample, said target polynucleotide selected from the group consisting of i) a polynucleotide 

comprising a polynucleotide sequence selected from the group consisting of SEQ ID NO:9-16, ii) a 
naturally occurring polynucleotide comprising a polynucleotide sequence at least 90% identical to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:9-16, iii) a polynucleotide 
complementary to the polynucleotide of i), iv) a polynucleotide complementary to the polynucleotide 

20 of ii), and v) an RNA equivalent of i)-iv). Alternatively, the target polynucleotide comprises a 
fragment of a polynucleotide sequence selected from the group consisting of i)-v) above; c) 
quantifying the amount of hybridization complex; and d) comparing the amount of hybridization 
complex in the treated biological sample with the amount of hybridization complex in an untreated 
biological sample, wherein a difference in the amount of hybridization complex in the treated 

25 biological sample is indicative of toxicity of the test compound. 

BRIEF DESCRIPTION OF THE TABLES 

Table 1 summarizes the nomenclature for the full length polynucleotide and polypeptide 
sequences of the present invention. 
30 Table 2 shows the GenBank identification number and annotation of the nearest GenBank 

homolog for polypeptides of the invention. The probability score for the match between each 
polypeptide and its GenBank homolog is also shown. 

Table 3 shows structural features of polypeptide sequences of the invention, including 
predicted motifs and domains, along with the methods, algorithms, and searchable databases used for 
35 analysis of the polypeptides. 
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GCREC. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other 
compound or composition which modulates the activity of GCREC either by directly interacting with 
GCREC or by acting on components of the biological pathway in which GCREC participates. 

An "allelic variant" is an alternative form of the gene encoding GCREC. Allelic variants may 
5 result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in 
polypeptides whose structure or function may or may not be altered. A gene may have none, one, or 
many allelic variants of its naturally occurring form. Common mutational changes which give rise to 
allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 
Each of these types of changes may occur alone, or in combination with the others, one or more times 

10 in a given sequence. 

"Altered" nucleic acid sequences encoding GCREC include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as GCREC or 
a polypeptide with at least one functional characteristic of GCREC. Included within this definition 
are polymorphisms which may or may not be readily detectable using a particular oligonucleotide 

15 probe of the polynucleotide encoding GCREC, and improper or unexpected hybridization to allelic 
variants, with a locus other than the normal chromosomal locus for the polynucleotide sequence 
encoding GCREC. The encoded protein may also be "altered," and may contain deletions, insertions, 
or substitutions of amino acid residues which produce a silent change and result in a functionally 
equivalent GCREC. Deliberate amino acid substitutions may be made on the basis of similarity in 

20 polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the 
residues, as long as the biological or immunological activity of GCREC is retained. For example, 
negatively charged amino acids may include aspartic acid and glutamic acid, and positively charged 
amino acids may include lysine and arginine. Amino acids with uncharged polar side chains having 
similar hydrophilicity values may include: asparagine and glutamine; and serine and threonine. 

25 Amino acids with uncharged side chains having similar hydrophilicity values may include: leucine, 
isoleucine, and valine; glycine and alanine; and phenylalanine and tyrosine. 

The terms "amino acid" and "amino acid sequence" refer to an oligopeptide, peptide, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or synthetic 
molecules. Where "amino acid sequence" is recited to refer to a sequence of a naturally occurring 

30 protein molecule, "amino acid sequence" and like terms are not meant to limit the amino acid 

sequence to the complete native amino acid sequence associated with the recited protein molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid sequence. 
Amplification is generally carried out using polymerase chain reaction (PCR) technologies well 
known in the art. 

35 The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity 
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"Complementary" describes the relationship between two single-stranded nucleic acid 
sequences that anneal by base-pairing. For example, 5-AGT-3 1 pairs with its complement, 
3*-TCA-5*. 

A "composition comprising a given polynucleotide sequence" and a "composition comprising 
5 a given amino acid sequence" refer broadly to any composition containing the given polynucleotide 
or amino acid sequence. The composition may comprise a dry formulation or an aqueous solution. 
Compositions comprising polynucleotide sequences encoding GCREC or fragments of GCREC may 
be employed as hybridization probes. The probes may be stored in freeze-dried form and may be 
associated with a stabilizing agent such as a carbohydrate. In hybridizations, the probe may be 

10 deployed in an aqueous solution containing salts (e.g., NaCl), detergents (e.g., sodium dodecyl 

sulfate; SDS), and other components (e.g., Denhardt's solution, dry milk, salmon sperm DNA, etc.). 

"Consensus sequence" refers to a nucleic acid sequence which has been subjected to repeated 
DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit (Applied 
Biosystems, Foster City CA) in the 5' and/or the 3' direction, and resequenced, or which has been 

15 assembled from one or more overlapping cDNA, EST, or genomic DNA fragments using a computer 
program for fragment assembly, such as the GELVIEW fragment assembly system (GCG, Madison 
WI) or Phrap (University of Washington, Seattle WA). Some sequences have been both extended and 
assembled to produce the consensus sequence. 

"Conservative amino acid substitutions" are those substitutions that are predicted to least 

20 interfere with the properties of the original protein, i.e., the structure and especially the function of 
the protein is conserved and not significantly changed by such substitutions. The table below shows 
amino acids which may be substituted for an original amino acid in a protein and which are regarded 
as conservative amino acid substitutions. 





Original Residue 


Conservative Substitution 


25 


Ala 


Gly, Ser 




Arg 


His, Lys 




Asn 


Asp, Gin, His 




Asp 


Asn, Glu 




Cys 


Ala, Ser 


30 


Gin 


Asn, Glu, His 




Glu 


Asp, Gin, His 




Gly 


Ala 




His 


Asn, Arg, Gin, Glu 




lie 


Leu, Val 


35 


Leu 


lie, Val 




Lys 


Arg, Gin, Glu 




Met 


Leu, He 




Phe 


His, Met, Leu, Trp, Tyr 




Ser 


Cys, Thr 


40 


Thr 


Ser, Val 
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genome from which the fragment was obtained. A fragment of SEQ ID NO:9-16 is useful, for 
example, in hybridization and amplification technologies and in analogous methods that distinguish 
SEQ ID NO:9-16 from related polynucleotide sequences. The precise length of a fragment of SEQ 
ID NO:9-16 and the region of SEQ ID NO:9-16 to which the fragment corresponds are routinely 
5 determinable by one of ordinary skill in the art based on the intended purpose for the fragment. 

A fragment of SEQ ID NO: 1-8 is encoded by a fragment of SEQ ID NO.9-16. A fragment of 
SEQ ID NO: 1-8 comprises a region of unique amino acid sequence that specifically identifies SEQ 
ID NO:l-8. For example, a fragment of SEQ ID NO:l-8 is useful as an immunogenic peptide for the 
development of antibodies that specifically recognize SEQ ID NO: 1-8. The precise length of a 
10 fragment of SEQ ID NO: 1-8 and the region of SEQ ID NO: 1-8 to which the fragment corresponds are 
routinely determinable by one of ordinary skill in the art based on the intended purpose for the 
fragment. 

A "full length" polynucleotide sequence is one containing at least a translation initiation 
codon (e.g., methionine) followed by an open reading frame and a translation termination codon. A 
15 "full length" polynucleotide sequence encodes a "full length" polypeptide sequence. 

"Homology" refers to sequence similarity or, interchangeably, sequence identity, between 
two or more polynucleotide sequences or two or more polypeptide sequences. 

The terms "percent identity" and "% identity," as applied to polynucleotide sequences, refer 
to the percentage of residue matches between at least two polynucleotide sequences aligned using a 
20 standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, gaps 
in the sequences being compared in order to optimize alignment between two sequences, and 
therefore achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e 
25 sequence alignment program. This program is part of the LASERGENE software package, a suite of 
molecular biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in 
Higgins, D.G. and P.M. Sharp (1989) CABIOS 5:151-153 and in Higgins, D.G. et al. (1992) CABIOS 
8:189-191. For pairwise alignments of polynucleotide sequences, the default parameters are set as 
follows: Ktuple=2, gap penalty=5, window=4, and "diagonals saved"=4. The "weighted" residue 
30 weight table is selected as the default. Percent identity is reported by CLUSTAL V as the "percent 
similarity" between aligned polynucleotide sequences. 

Alternatively, a suite of commonly used and freely available sequence comparison algorithms 
is provided by the National Center for Biotechnology Information (NCBI) Basic Local Alignment 
Search Tool (BLAST) (Altschul, S.F. et al. (1990) J. MoL Biol. 215:403-410), which is available 
35 from several sources, including the NCBI, Bethesda, MD, and on the Internet at 
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parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e 
sequence alignment program (described and referenced above). For pairwise alignments of 
polypeptide sequences using CLUSTAL V, the default parameters are set as follows: Ktuple=l, gap 
penalty=3, window=5, and "diagonals saved"=5. The PAM250 matrix is selected as the default 
5 residue weight table. As with polynucleotide alignments, the percent identity is reported by 
CLUSTAL V as the "percent similarity" between aligned polypeptide sequence pairs. 

Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 
comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 
2.0.12 (April-2 1-2000) with blastp set at default parameters. Such default parameters may be, for 
10 example: 

Matrix: BLOSUM62 

Open Gap: 11 and Extension Gap: 1 penalties 
Gap x drop-off: 50 
Expect: 10 
15 Word Size: 3 

Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, 
for example, as defined by a particular SEQ ED number, or may be measured over a shorter length, for 
example, over the length of a fragment taken from a larger, defined polypeptide sequence, for 

20 instance, a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 
150 contiguous residues. Such lengths are exemplary only, and it is understood that any fragment 
length supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be 
used to describe a length over which percentage identity may be measured. 

"Human artificial chromosomes" (HACs) are linear microchromosomes which may contain 

25 DNA sequences of about 6 kb to 10 Mb in size and which contain all of the elements required for 
chromosome replication, segregation and maintenance. 

The term "humanized antibody" refers to an antibody molecule in which the amino acid 
sequence in the non-antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 

30 "Hybridization" refers to the process by which a polynucleotide strand anneals with a 

complementary strand through base pairing under defined hybridization conditions. Specific 
hybridization is an indication that two nucleic acid sequences share a high degree of complementarity. 
Specific hybridization complexes form under permissive annealing conditions and remain hybridized 
after the "washing" step(s). The washing step(s) is particularly important in determining the 

35 stringency of the hybridization process, with more stringent conditions allowing less non-specific 
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of various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect 
cellular and systemic defense systems. 

An "immunogenic fragment" is a polypeptide or oligopeptide fragment of GCREC which is 
capable of eliciting an immune response when introduced into a living organism, for example, a 
5 mammal. The term "immunogenic fragment" also includes any polypeptide or oligopeptide fragment 
of GCREC which is useful in any of the antibody production methods disclosed herein or known in 
the art. 

The term "microarray" refers to an arrangement of a plurality of polynucleotides, 
polypeptides, or other chemical compounds on a substrate. 
10 The terms "element" and "array element" refer to a polynucleotide, polypeptide, or other 

chemical compound having a unique and defined position on a microarray. 

The term "modulate" refers to a change in the activity of GCREC. For example, modulation 
may cause an increase or a decrease in protein activity, binding characteristics, or any other 
biological, functional, or immunological properties of GCREC. 
15 The phrases "nucleic acid" and "nucleic acid sequence" refer to a nucleotide, oligonucleotide, 

polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA of genomic or 
synthetic origin which may be single-stranded or double-stranded and may represent the sense or the 
antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-Iike material. 

"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 
20 functional relationship with a second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 
sequence. Operably linked DNA sequences may be in close proximity or contiguous and, where 
necessary to join two protein coding regions, in the same reading frame. 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
25 comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of 
amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. 
PNAs preferentially bind complementary single stranded DNA or RNA and stop transcript 
elongation, and may be pegylated to extend their lifespan in the cell. 

"Post-translational modification" of an GCREC may involve lipidation, glycosylation, 
30 phosphorylation, acetylation, racemization, proteolytic cleavage, and other modifications known in 
the art. These processes may occur synthetically or biochemically. Biochemical modifications will 
vary by cell type depending on the enzymatic milieu of GCREC. 

"Probe" refers to nucleic acid sequences encoding GCREC, their complements, or fragments 
thereof, which are used to detect identical, allelic or related nucleic acid sequences. Probes are 
35 isolated oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. 
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regions of aligned nucleic acid sequences. Hence, this program is useful for identification of both 
unique and conserved oligonucleotides and polynucleotide fragments. The oligonucleotides and 
polynucleotide fragments identified by any of the above selection methods are useful in hybridization 
technologies, for example, as PCR or sequencing primers, microarray elements, or specific probes to 
5 identify fully or partially complementary polynucleotides in a sample of nucleic acids. Methods of 
oligonucleotide selection are not limited to those described above. 

A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence 
that is made by an artificial combination of two or more otherwise separated segments of sequence. 
This artificial combination is often accomplished by chemical synthesis or, more commonly, by the 

10 artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques 
such as those described in Sambrook, supra . The term recombinant includes nucleic acids that have 
been altered solely by addition, substitution, or deletion of a portion of the nucleic acid. Frequently, a 
recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter 
sequence. Such a recombinant nucleic acid may be part of a vector that is used, for example, to 

15 transform a cell. 

Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 
vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 
expressed, inducing a protective immunological response in the mammal. 

A "regulatory element" refers to a nucleic acid sequence usually derived from untranslated 
20 regions of a gene and includes enhancers, promoters, introns, and 5' and 3' untranslated regions 
(UTRs). Regulatory elements interact with host or viral proteins which control transcription, 
translation, or RNA stability. 

"Reporter molecules" are chemical or biochemical moieties used for labeling a nucleic acid, 
amino acid, or antibody. Reporter molecules include radionuclides; enzymes; fluorescent, 
25 chemiluminescent, or chromogenic agents; substrates; cofactors; inhibitors; magnetic particles; and 
other moieties known in the art. 

An "RNA equivalent," in reference to a DNA sequence, is composed of the same linear 
sequence of nucleotides as the reference DNA sequence with the exception that all occurrences of the 
nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose 
30 instead of deoxyribose. 

The term "sample" is used in its broadest sense. A sample suspected of containing GCREC, 
nucleic acids encoding GCREC, or fragments thereof may comprise a bodily fluid; an extract from a 
cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA, RNA, or 
cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc. 
35 The terms "specific binding" and "specifically binding" refer to that interaction between a 
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cyanobacteria, fungi, plants and animals. The isolated DNA of the present invention can be 
introduced into the host by methods known in the art, for example infection, transfection, 
transformation or transconjugation. Techniques for transferring the DNA of the present invention 
into such organisms are widely known and provided in references such as Sambrook et al. (1989), 
5 supra. 

A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having 
at least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of 
the nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
1999) set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at 

10 least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 
93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% or greater 
sequence identity over a certain defined length. A variant may be described as, for example, an 
"allelic" (as defined above), "splice," "species," or "polymorphic" variant. A splice variant may have 
significant identity to a reference molecule, but will generally have a greater or lesser number of 

15 polynucleotides due to alternative splicing of exons during mRNA processing. The corresponding 
polypeptide may possess additional functional domains or lack domains that are present in the 
reference molecule. Species variants are polynucleotide sequences that vary from one species to 
another. The resulting polypeptides will generally have significant amino acid identity relative to 
each other. A polymorphic variant is a variation in the polynucleotide sequence of a particular gene 

20 between individuals of a given species. Polymorphic variants also may encompass "single nucleotide 
polymorphisms" (SNPs) in which the polynucleotide sequence varies by one nucleotide base. The 
presence of SNPs may be indicative of, for example, a certain population, a disease state, or a 
propensity for a disease state. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having 

25 at least 40% sequence identity to the particular polypeptide sequence over a certain length of one of 
the polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
1999) set at default parameters. Such a pair of polypeptides may show, for example, at least 50%, at 
least 60%, at least 70%, at least 80%, at least 90%, at least 91%, at least 92%, at least 93%, at least 
94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% or greater sequence 

30 identity over a certain defined length of one of the polypeptides. 



THE INVENTION 

The invention is based on the discovery of new human G-protein coupled receptors 
(GCREC), the polynucleotides encoding GCREC, and the use of these compositions for the diagnosis, 
35 treatment, or prevention of cell proliferative, neurological, cardiovascular, gastrointestinal, 
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(GenBank ID g2 19867) as determined by BLAST. (See Table 2.) The BLAST probability score is 
3.7e-88. Data from BLAST-DOMO, MOTIFS, BLIMPS-BLOCKS, BLIMPS-PRINTS, 
PROFILESCAN, HMMER-PFAM, and SPSCAN analyses provide further corroborative evidence 
that SEQ ID NO:3 is a G-protein coupled receptor. In an alternative example, SEQ ID NO:8 is 50% 

5 identical to a predicted mouse odorant receptor-like protein (GenBank ID g6532001) as determined 
by the Basic Local Alignment Search Tool (BLAST). (See Table 2.) The BLAST probability score 
is 8.4e-75, which indicates the probability of obtaining the observed polypeptide sequence alignment 
by chance. SEQ ED NO:8 also contains a seven transmembrane receptor (rhodopsin family) domain 
as determined by searching for statistically significant matches in the hidden Markov model (HMM)- 

10 based PF AM database of conserved protein family domains. (See Table 3.) Data from BLIMPS, 
MOTIFS, and PROFILESCAN analyses provide further corroborative evidence that SEQ ID NO: 8 is 
a G-protein coupled receptor. SEQ ID NO: 1 and SEQ ED NO:4-7 were analyzed and annotated in a 
similar manner. The algorithms and parameters for the analysis of SEQ ID NO: 1-8 are described in 
Table 7. 

15 As shown in Table 4, the full length polynucleotide sequences of the present invention were 

assembled using cDNA sequences or coding (exon) sequences derived from genomic DNA, or any 
combination of these two types of sequences. Columns 1 and 2 list the polynucleotide sequence 
identification number (Polynucleotide SEQ ID NO:) and the corresponding Incyte polynucleotide 
consensus sequence number (Incyte Polynucleotide ID) for each polynucleotide of the invention. 

20 Column 3 shows the length of each polynucleotide sequence in basepairs. Column 4 lists fragments 
of the polynucleotide sequences which are useful, for example, in hybridization or amplification 
technologies that identify SEQ ID NO.9-16 or that distinguish between SEQ ID NO.9-16 and related 
polynucleotide sequences. Column 5 shows identification numbers corresponding to cDNA 
sequences, coding sequences (exons) predicted from genomic DNA, and/or sequence assemblages 

25 comprised of both cDNA and genomic DNA. These sequences were used to assemble the full length 
polynucleotide sequences of the invention. Columns 6 and 7 of Table 4 show the nucleotide start (5') 
and stop (3') positions of the cDNA and/or genomic sequences in column 5 relative to their respective 
full length sequences. 

The identification numbers in Column 5 of Table 4 may refer specifically, for example, to 
30 Incyte cDNAs along with their corresponding cDNA libraries. For example, 625 1915H1 is the 
identification number of an Incyte cDNA sequence, and LUNPTUT02 is the cDNA library from 
which it is derived. Incyte cDNAs for which cDNA libraries are not indicated were derived from 
pooled cDNA libraries (e.g., 7081 2700V 1). Alternatively, the identification numbers in column 5 
may refer to GenBank cDNAs or ESTs (e.g., g5663306) which contributed to the assembly of the full 
35 length polynucleotide sequences. Alternatively, the identification numbers in column 5 may refer to 
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polynucleotide sequence comprising a sequence selected from the group consisting of SEQ ID NO:9- 
16 which has at least about 70%, or alternatively at least about 85%, or even at least about 95% 
polynucleotide sequence identity to a nucleic acid sequence selected from the group consisting of 
SEQ ID NO:9-16. Any one of the polynucleotide variants described above can encode an amino acid 
5 sequence which contains at least one functional or structural characteristic of GCREC. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of the 
genetic code, a multitude of polynucleotide sequences encoding GCREC, some bearing minimal 
similarity to the polynucleotide sequences of any known and naturally occurring gene, may be 
produced. Thus, the invention contemplates each and every possible variation of polynucleotide 

10 sequence that could be made by selecting combinations based on possible codon choices. These 
combinations are made in accordance with the standard triplet genetic code as applied to the 
polynucleotide sequence of naturally occurring GCREC, and all such variations are to be considered 
as being specifically disclosed. 

Although nucleotide sequences which encode GCREC and its variants are generally capable 

15 of hybridizing to the nucleotide sequence of the naturally occurring GCREC under appropriately 
selected conditions of stringency, it may be advantageous to produce nucleotide sequences encoding 
GCREC or its derivatives possessing a substantially different codon usage, e.g., inclusion of non- 
naturally occurring codons. Codons may be selected to increase the rate at which expression of the 
peptide occurs in a particular prokaryotic or eukaryotic host in accordance with the frequency with 

20 which particular codons are utilized by the host. Other reasons for substantially altering the 

nucleotide sequence encoding GCREC and its derivatives without altering the encoded amino acid 
sequences include the production of RNA transcripts having more desirable properties, such as a 
greater half-life, than transcripts produced from the naturally occurring sequence. 

The invention also encompasses production of DNA sequences which encode GCREC and 

25 GCREC derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 
synthetic sequence may be inserted into any of the many available expression vectors and cell 
systems using reagents well known in the art. Moreover, synthetic chemistry may be used to 
introduce mutations into a sequence encoding GCREC or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable of 

30 hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 
NO:9-16 and fragments thereof under various conditions of stringency. (See, e.g., Wahl, G.M. and 
S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 
152:507-51 1.) Hybridization conditions, including annealing and wash conditions, are described in 
"Definitions." 

35 Methods for DNA sequencing are well known in the art and may be used to practice any of 
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size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5* regions of genes, are preferable for situations in which an oligo d(T) 
library does not yield a full-length cDNA. Genomic libraries may be useful for extension of sequence 
into 5' non-transcribed regulatory regions. 
5 Capillary electrophoresis systems which are commercially available may be used to analyze 

the size or confirm the nucleotide sequence of sequencing or PCR products. In particular, capillary 
sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 
specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 

10 software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Applied Biosystems), and the entire 
process from loading of samples to computer analysis and electronic data display may be computer 
controlled. Capillary electrophoresis is especially preferable for sequencing small DNA fragments 
which may be present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide sequences or fragments thereof 

15 which encode GCREC may be cloned in recombinant DNA molecules that direct expression of 

GCREC, or fragments or functional equivalents thereof, in appropriate host cells. Due to the inherent 
degeneracy of the genetic code, other DNA sequences which encode substantially the same or a 
functionally equivalent amino acid sequence may be produced and used to express GCREC. 

The nucleotide sequences of the present invention can be engineered using methods generally 

20 known in the art in order to alter GCREC-encoding sequences for a variety of purposes including, but 
not limited to, modification of the cloning, processing, and/or expression of the gene product. DNA 
shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic 
oligonucleotides may be used to engineer the nucleotide sequences. For example, oligonucleotide- 
mediated site-directed mutagenesis may be used to introduce mutations that create new restriction 

25 sites, alter glycosylation patterns, change codon preference, produce splice variants, and so forth. 

The nucleotides of the present invention may be subjected to DNA shuffling techniques such 
as MOLECULARBREEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent Number 
5,837,458; Chang, C-C. et al. (1999) Nat. Biotechnol. 17:793-797; Christians, F.C. et al. (1999) Nat. 
Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat. Biotechnol. 14:315-319) to alter or 

30 improve the biological properties of GCREC, such as its biological or enzymatic activity or its ability 
to bind to other molecules or compounds. DNA shuffling is a process by which a library of gene 
variants is produced using PCR-mediated recombination of gene fragments. The library is then 
subjected to selection or screening procedures that identify those gene variants with the desired 
properties. These preferred variants may then be pooled and further subjected to recursive rounds of 

35 DNA shuffling and selection/screening. Thus, genetic diversity is created through "artificial" 
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may be enhanced by the inclusion of enhancers appropriate for the particular host cell system used. 
(See, e.g., Scharf, D. et al. (1994) Results ProbL Cell Differ. 20:125-162.) 

Methods which are well known to those skilled in the art may be used to construct expression 
vectors containing sequences encoding GCREC and appropriate transcriptional and translational 
5 control elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, 
and in vivo genetic recombination. (See, e.g., Sambrook, J. et al. (1989) Molecular Cloning, A 
Laboratory Manual Cold Spring Harbor Press, Plain view NY, ch. 4, 8, and 16-17; Ausubel, F.M. et 
al. (1995) Current Protocols in Molecular Biology , John Wiley & Sons, New York NY, ch. 9, 13, and 
16.) 

10 A variety of expression vector/host systems may be utilized to contain and express sequences 

encoding GCREC. These include, but are not limited to, microorganisms such as bacteria 
transformed with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast 
transformed with yeast expression vectors; insect cell systems infected with viral expression vectors 
(e.g., baculovirus); plant cell systems transformed with viral expression vectors (e.g., cauliflower 

15 mosaic virus, CaMV, or tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or 
pBR322 plasmids); or animal cell systems. (See, e.g., Sambrook, supra ; Ausubel, supra ; Van Heeke, 
G. and S.M. Schuster (1989) J. Biol. Chem. 264:5503-5509; Engelhard, E.K. et al. (1994) Proc. Natl. 
Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937-1945; Takamatsu, 
N. (1987) EMBO J. 6:307-3 1 1 ; The McGraw Hill Yearbook of Science and Technology (1992) 

20 McGraw Hill, New York NY, pp. 191-196; Logan, J. and T. Shenk (1984) Proc. Natl. Acad. Sci. USA 
81:3655-3659; and Harrington, JJ. et al. (1997) Nat. Genet. 15:345-355.) Expression vectors derived 
from retroviruses, adenoviruses, or herpes or vaccinia viruses, or from various bacterial plasmids, 
may be used for delivery of nucleotide sequences to the targeted organ, tissue, or cell population. 
(See, e.g., Di Nicola, M. et al. (1998) Cancer Gen. Ther. 5(6):350-356; Yu, M. et al. (1993) Proc. 

25 Natl. Acad. Sci. USA 90(1 3): 6340-6344; Buller, R.M. et al. (1985) Nature 317(6040):813-815; 
McGregor, D.P. et al. (1994) Mol. Immunol. 3 1(3):2 19-226; and Verma, I.M. and N. Somia (1997) 
Nature 389:239-242.) The invention is not limited by the host cell employed. 

In bacterial systems, a number of cloning and expression vectors may be selected depending 
upon the use intended for polynucleotide sequences encoding GCREC. For example, routine cloning, 

30 subcloning, and propagation of polynucleotide sequences encoding GCREC can be achieved using a 
multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La Jolla CA) or PSPORT1 
plasmid (Life Technologies). Ligation of sequences encoding GCREC into the vector's multiple 
cloning site disrupts the lacL gene, allowing a colorimetric screening procedure for identification of 
transformed bacteria containing recombinant molecules. In addition, these vectors may be useful for 

35 in vitro transcription, dideoxy sequencing, single strand rescue with helper phage, and creation of 
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into cell lines using expression vectors which may contain viral origins of replication and/or 
endogenous expression elements and a selectable marker gene on the same or on a separate vector. 
Following the introduction of the vector, cells may be allowed to grow for about 1 to 2 days in 
enriched media before being switched to selective media. The purpose of the selectable marker is to 
5 confer resistance to a selective agent, and its presence allows growth and recovery of cells which 
successfully express the introduced sequences. Resistant clones of stably transformed cells may be 
propagated using tissue culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. These 
include, but are not limited to, the herpes simplex virus thymidine kinase and adenine 

10 phosphoribosyltransferase genes, for use in tk' and apf cells, respectively. (See, e.g., Wigler, M. et 
al. (1977) Cell 1 1 :223-232; Lowy, I. et al. (1980) Cell 22:817-823.) Also, antimetabolite, antibiotic, 
or herbicide resistance can be used as the basis for selection. For example, dhfr confers resistance to 
methotrexate; neo confers resistance to the aminoglycosides neomycin and G-418; and als and pat 
confer resistance to chlorsulfuron and phosphinotricin acetyltransferase, respectively. (See, e.g., 

15 Wigler, M. et al. (1980) Proc. Natl. Acad. Sci. USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) 
J. Mol. Biol. 150:1-14.) Additional selectable genes have been described, e.g., trpB and hisD y which 
alter cellular requirements for metabolites. (See, e.g., Hartman, S.C. and R.C. Mulligan (1988) Proc. 
Natl. Acad. Sci. USA 85:8047-8051.) Visible markers, e.g., anthocyanins, green fluorescent proteins 
(GFP; Clontech), 8 glucuronidase and its substrate B-glucuronide, or luciferase and its substrate 

20 luciferin may be used. These markers can be used not only to identify transformants, but also to 
quantify the amount of transient or stable protein expression attributable to a specific vector system. 
(See, e.g., Rhodes, C.A. (1995) Methods Mol. Biol. 55:121-131.) 

Although the presence/absence of marker gene expression suggests that the gene of interest is 
also present, the presence and expression of the gene may need to be confirmed. For example, if the 

25 sequence encoding GCREC is inserted within a marker gene sequence, transformed cells containing 
sequences encoding GCREC can be identified by the absence of marker gene function. Alternatively, 
a marker gene can be placed in tandem with a sequence encoding GCREC under the control of a 
single promoter. Expression of the marker gene in response to induction or selection usually 
indicates expression of the tandem gene as well. 

30 In general, host cells that contain the nucleic acid sequence encoding GCREC and that 

express GCREC may be identified by a variety of procedures known to those of skill in the art. These 
procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR 
amplification, and protein bioassay or immunoassay techniques which include membrane, solution, or 
chip based technologies for the detection and/or quantification of nucleic acid or protein sequences. 

35 Immunological methods for detecting and measuring the expression of GCREC using either 
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modification and processing of the foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant nucleic acid 
sequences encoding GCREC may be ligated to a heterologous sequence resulting in translation of a 
fusion protein in any of the aforementioned host systems. For example, a chimeric GCREC protein 
5 containing a heterologous moiety that can be recognized by a commercially available antibody may 
facilitate the screening of peptide libraries for inhibitors of GCREC activity. Heterologous protein 
and peptide moieties may also facilitate purification of fusion proteins using commercially available 
affinity matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), 
maltose binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, 

10 c-myc, and hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification of their 
cognate fusion proteins on immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and 
metal-chelate resins, respectively. FLAG, c-wyc, and hemagglutinin (HA) enable immunoaffinity 
purification of fusion proteins using commercially available monoclonal and polyclonal antibodies 
that specifically recognize these epitope tags. A fusion protein may also be engineered to contain a 

15 proteolytic cleavage site located between the GCREC encoding sequence and the heterologous 
protein sequence, so that GCREC may be cleaved away from the heterologous moiety following 
purification. Methods for fusion protein expression and purification are discussed in Ausubel (1995, 
supra , ch. 10). A variety of commercially available kits may also be used to facilitate expression and 
purification of fusion proteins. 

20 In a further embodiment of the invention, synthesis of radiolabeled GCREC may be achieved 

in vitro using the TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These 
systems couple transcription and translation of protein-coding sequences operably associated with the 
T7, T3, or SP6 promoters. Translation takes place in the presence of a radiolabeled amino acid 
precursor, for example, 35 S-methionine. 

25 GCREC of the present invention or fragments thereof may be used to screen for compounds 

that specifically bind to GCREC. At least one and up to a plurality of test compounds may be 
screened for specific binding to GCREC. Examples of test compounds include antibodies, 
oligonucleotides, proteins (e.g., receptors), or small molecules. 

In one embodiment, the compound thus identified is closely related to the natural ligand of 

30 GCREC, e.g., a ligand or fragment thereof, a natural substrate, a structural or functional mimetic, or a 
natural binding partner. (See, e.g., Coligan, J.E. et al. (1991) Current Protocols in Immunology 1(2): 
Chapter 5.) Similarly, the compound can be closely related to the natural receptor to which GCREC 
binds, or to at least a fragment of the receptor, e.g., the ligand binding site. In either case, the 
compound can be rationally designed using known techniques. In one embodiment, screening for 

35 these compounds involves producing appropriate cells which express GCREC, either as a secreted 
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blastocysts such as those from the C57BL/6 mouse strain. The blastocysts are surgically transferred 
to pseudopregnant dams, and the resulting chimeric progeny are genotyped and bred to produce 
heterozygous or homozygous strains. Transgenic animals thus generated may be tested with potential 
therapeutic or toxic agents. 
5 Polynucleotides encoding GCREC may also be manipulated in vitro in ES cells derived from 

human blastocysts. Human ES cells have the potential to differentiate into at least eight separate cell 
lineages including endoderm, mesoderm, and ectodermal cell types. These cell lineages differentiate 
into, for example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J.A. et al. 
(1998) Science 282:1145-1147). 

10 Polynucleotides encoding GCREC can also be used to create "knockin" humanized animals 

(pigs) or transgenic animals (mice or rats) to model human disease. With knockin technology, a 
region of a polynucleotide encoding GCREC is injected into animal ES cells, and the injected 
sequence integrates into the animal cell genome. Transformed cells are injected into blastulae, and 
the blastulae are implanted as described above. Transgenic progeny or inbred lines are studied and 

15 treated with potential pharmaceutical agents to obtain information on treatment of a human disease. 
Alternatively, a mammal inbred to overexpress GCREC, e.g., by secreting GCREC in its milk, may 
also serve as a convenient source of that protein (Janne, J. et al. (1998) Biotechnol. Annu. Rev. 4:55- 
74). 

THERAPEUTICS 

20 Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 

between regions of GCREC and G-protein coupled receptors. In addition, the expression of GCREC 
is closely associated with brain tumor, breast tumor, liver, fetal kidney, and fetal thymus tissue. 
Therefore, GCREC appears to play a role in cell proliferative, neurological, cardiovascular, 
gastrointestinal, autoimmune/inflammatory, and metabolic disorders, and viral infections. In the 

25 treatment of disorders associated with increased GCREC expression or activity, it is desirable to 
decrease the expression or activity of GCREC. In the treatment of disorders associated with 
decreased GCREC expression or activity, it is desirable to increase the expression or activity of 
GCREC. 

Therefore, in one embodiment, GCREC or a fragment or derivative thereof may be 
30 administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of GCREC. Examples of such disorders include, but are not limited to, a cell proliferative 
disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed 
connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, 
polycythemia vera, psoriasis, primary thrombocythemia, and cancers including adenocarcinoma, 
35 leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of 
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carcinoma, biliary tract disease, hepatitis, hyperbilirubinemia, cirrhosis, passive congestion of the 
liver, hepatoma, infectious colitis, ulcerative colitis, ulcerative proctitis, Crohn's disease, Whipple's 
disease, Mallory-Weiss syndrome, colonic carcinoma, colonic obstruction, irritable bowel syndrome, 
short bowel syndrome, diarrhea, constipation, gastrointestinal hemorrhage, acquired 
5 immunodeficiency syndrome (AIDS) enteropathy, jaundice, hepatic encephalopathy, hepatorenal 
syndrome, hepatic steatosis, hemochromatosis, Wilson's disease, alpha r antitrypsin deficiency, 
Reye's syndrome, primary sclerosing cholangitis, liver infarction, portal vein obstruction and 
thrombosis, centrilobular necrosis, peliosis hepatis, hepatic vein thrombosis, veno-occlusive disease, 
preeclampsia, eclampsia, acute fatty liver of pregnancy, intrahepatic cholestasis of pregnancy, and 

10 hepatic tumors including nodular hyperplasias, adenomas, and carcinomas; an 

autoimmune/inflammatory disorder such as acquired immunodeficiency syndrome (AIDS), 
Addison's disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, 
anemia, asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, 
autoimmune polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, 

15 cholecystitis, contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes 
mellitus, emphysema, episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, 
erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves' 
disease, Hashimoto's thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple sclerosis, 
myasthenia gravis, myocardial or pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, 

20 polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, 
systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, 
ulcerative colitis, uveitis, Werner syndrome, complications of cancer, hemodialysis, and 
extracorporeal circulation, viral, bacterial, fungal, parasitic, protozoal, and helminthic infections, and 
trauma; a metabolic disorder such as diabetes, obesity, and osteoporosis; and an infection by a viral 

25 agent classified as adenovirus, arenavirus, bunyavims, calicivirus, coronavirus, filovirus, 

hepadnavirus, herpesvirus, flavivirus, orthomyxovirus, parvovirus, papovavirus, paramyxovirus, 
picornavirus, poxvirus, reovirus, retrovirus, rhabdovirus, and tongavirus. 

In another embodiment, a vector capable of expressing GCREC or a fragment or derivative 
thereof may be administered to a subject to treat or prevent a disorder associated with decreased 

30 expression or activity of GCREC including, but not limited to, those described above. 

In a further embodiment, a composition comprising a substantially purified GCREC in 
conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat or prevent 
a disorder associated with decreased expression or activity of GCREC including, but not limited to, 
those provided above. 
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GCREC have an amino acid sequence consisting of at least about 5 amino acids, and generally will 
consist of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or 
fragments are identical to a portion of the amino acid sequence of the natural protein. Short stretches 
of GCREC amino acids may be fused with those of another protein, such as KLH, and antibodies to 
5 the chimeric molecule may be produced. 

Monoclonal antibodies to GCREC may be prepared using any technique which provides for 
the production of antibody molecules by continuous cell lines in culture. These include, but are not 
limited to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV-hybridoma 
technique. (See, e.g., Kohler, G. et aL (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. 

10 Immunol. Methods 81:31-42; Cote, RJ. et al. (1983) Proc. Natl. Acad. Sci. USA 80:2026-2030; and 
Cole, S.P. et al. (1984) Mol. Cell Biol. 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies," such as the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
antigen specificity and biological activity, can be used. (See, e.g., Morrison, S.L. et al. (1984) Proc. 

15 Natl. Acad. Sci. USA 81 :685 1-6855; Neuberger, M.S. et al. (1984) Nature 312:604-608; and Takeda, 
S. et al. (1985) Nature 314:452-454.) Alternatively, techniques described for the production of single 
chain antibodies may be adapted, using methods known in the art, to produce GCREC-specific single 
chain antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may be 
generated by chain shuffling from random combinatorial immunoglobulin libraries. (See, e.g., 

20 Burton, D.R. (1991) Proc. Natl. Acad. Sci. USA 88:10134-10137.) 

Antibodies may also be produced by inducing in vivo production in the lymphocyte 
population or by screening immunoglobulin libraries or panels of highly specific binding reagents as 
disclosed in the literature. (See, e.g., Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci. USA 
86:3833-3837; Winter, G. et al. (1991) Nature 349:293-299.) 

25 Antibody fragments which contain specific binding sites for GCREC may also be generated. 

For example, such fragments include, but are not limited to, F(ab') 2 fragments produced by pepsin 
digestion of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of 
the F(ab')2 fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and 
easy identification of monoclonal Fab fragments with the desired specificity. (See, e.g., Huse, W.D. 

30 et al. (1989) Science 246:1275-1281.) 

Various immunoassays may be used for screening to identify antibodies having the desired 
specificity. Numerous protocols for competitive binding or immunoradiometric assays using either 
polyclonal or monoclonal antibodies with established specificities are well known in the art. Such 
immunoassays typically involve the measurement of complex formation between GCREC and its 

35 specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies 
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complementary to at least a portion of the cellular sequence encoding the target protein. (See, e.g., 
Slater, J.E. et al. (1998) J. Allergy Cli. Immunol. 102(3):469-475; and Scanlon, K.J. et al. (1995) 
9(13):1288-1296.) Antisense sequences can also be introduced intracellularly through the use of viral 
vectors, such as retrovirus and adeno-associated virus vectors. (See, e.g., Miller, A.D. (1990) Blood 
5 76:271; Ausubel, supra; Uckert, W. and W. Walther (1994) Pharmacol. Then 63(3):323-347.) Other 
gene delivery mechanisms include liposome-derived systems, artificial viral envelopes, and other 
systems known in the art. (See, e.g., Rossi, JJ. (1995) Br. Med. Bull. 51(l):217-225; Boado, RJ. et 
al. (1998) J. Pharm. Sci. 87(11):1308-1315; and Morris, M.C. et al. (1997) Nucleic Acids Res. 
25(14):2730-2736.) 

10 In another embodiment of the invention, polynucleotides encoding GCREC may be used for 

somatic or germline gene therapy. Gene therapy may be performed to (i) correct a genetic deficiency 
(e.g., in the cases of severe combined immunodeficiency (SCDD)-Xl disease characterized by X- 
linked inheritance (Cavazzana-Calvo, M. et al. (2000) Science 288:669-672), severe combined 
immunodeficiency syndrome associated with an inherited adenosine deaminase (ADA) deficiency 

15 (Blaese, R.M. et al. (1995) Science 270:475-480; Bordignon, C. et al. (1995) Science 270:470-475), 
cystic fibrosis (Zabner, J. et al. (1993) Cell 75:207-216; Crystal, R.G. et al. (1995) Hum. Gene 
Therapy 6:643-666; Crystal, R.G. et al. (1995) Hum. Gene Therapy 6:667-703), thalassamias, familial 
hypercholesterolemia, and hemophilia resulting from Factor VIII or Factor EX deficiencies (Crystal, 
R.G. (1995) Science 270:404-410; Verma, I.M. and N. Somia (1997) Nature 389:239-242)), (ii) 

20 express a conditionally lethal gene product (e.g., in the case of cancers which result from unregulated 
cell proliferation), or (iii) express a protein which affords protection against intracellular parasites 
(e.g., against human retroviruses, such as human immunodeficiency virus (HIV) (Baltimore, D. 
(1988) Nature 335:395-396; Poeschla, E. et al. (1996) Proc. Natl. Acad. Sci. USA. 93:1 1395-1 1399), 
hepatitis B or C virus (HBV, HCV); fungal parasites, such as Candida albicans and Paracoccidioides 

25 brasiliensis ; and protozoan parasites such as Plasmodium falciparum and Trypanosoma cruzi) . In the 
case where a genetic deficiency in GCREC expression or regulation causes disease, the expression of 
GCREC from an appropriate population of transduced cells may alleviate the clinical manifestations 
caused by the genetic deficiency. 

In a further embodiment of the invention, diseases or disorders caused by deficiencies in 

30 GCREC are treated by constructing mammalian expression vectors encoding GCREC and introducing 
these vectors by mechanical means into GCREC-deficient cells. Mechanical transfer technologies for 
use with cells in vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) 
ballistic gold particle delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene 
transfer, and (v) the use of DNA transposons (Morgan, R.A. and W.F. Anderson (1993) Annu. Rev. 

35 Biochem. 62:191-217; Ivies, Z. (1997) Cell 91:501-510; Boulay, J-L. and H. Recipon (1998) Curr. 
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scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, 
GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 
corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
secondary structural features which may render the oligonucleotide inoperable. The suitability of 
5 candidate targets may also be evaluated by testing accessibility to hybridization with complementary 
oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be prepared 
by any method known in the art for the synthesis of nucleic acid molecules. These include techniques 
for chemically synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis. 

10 Alternatively, RNA molecules may be generated by in vitro and in vivo transcription of DNA 
sequences encoding GCREC. Such DNA sequences may be incorporated into a wide variety of 
vectors with suitable RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA 
constructs that synthesize complementary RNA, constitutively or inducibly, can be introduced into 
cell lines, cells, or tissues. 

15 RNA molecules may be modified to increase intracellular stability and half-life. Possible 

modifications include, but are not limited to, the addition of flanking sequences at the 5 'and/or 3' 
ends of the molecule, or the use of phosphorothioate or 2'0-methyl rather than phosphodiesterase 
linkages within the backbone of the molecule. This concept is inherent in the production of PNAs 
and can be extended in all of these molecules by the inclusion of nontraditional bases such as inosine, 

20 queosine, and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, 
cytidine, guanine, thymine, and uridine which are not as easily recognized by endogenous 
endonucleases. 

An additional embodiment of the invention encompasses a method for screening for a 
compound which is effective in altering expression of a polynucleotide encoding GCREC. 

25 Compounds which may be effective in altering expression of a specific polynucleotide may include, 
but are not limited to, oligonucleotides, antisense oligonucleotides, triple helix-forming 
oligonucleotides, transcription factors and other polypeptide transcriptional regulators, and non- 
macromolecular chemical entities which are capable of interacting with specific polynucleotide 
sequences. Effective compounds may alter polynucleotide expression by acting as either inhibitors or 

30 promoters of polynucleotide expression. Thus, in the treatment of disorders associated with increased 
GCREC expression or activity, a compound which specifically inhibits expression of the 
polynucleotide encoding GCREC may be therapeutically useful, and in the treatment of disorders 
associated with decreased GCREC expression or activity, a compound which specifically promotes 
expression of the polynucleotide encoding GCREC may be therapeutically useful. 

35 At least one, and up to a plurality, of test compounds may be screened for effectiveness in 
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excipient. Excipients may include, for example, sugars, starches, celluloses, gums, and proteins. 
Various formulations are commonly known and are thoroughly discussed in the latest edition of 
Remington's Pharmaceutical Sciences (Maack Publishing, Easton PA). Such compositions may 
consist of GCREC, antibodies to GCREC, and mimetics, agonists, antagonists, or inhibitors of 
5 GCREC. 

The compositions utilized in this invention may be administered by any number of routes 
including, but not limited to, oral, intravenous, intramuscular, intra-arterial, intramedullary, 
intrathecal, intraventricular, pulmonary, transdermal, subcutaneous, intraperitoneal, intranasal, 
enteral, topical, sublingual, or rectal means. 

10 Compositions for pulmonary administration may be prepared in liquid or dry powder form. 

These compositions are generally aerosolized immediately prior to inhalation by the patient. In the 
case of small molecules (e.g. traditional low molecular weight organic drugs), aerosol delivery of 
fast-acting formulations is well-known in the art. In the case of macromolecules (e.g. larger peptides 
and proteins), recent developments in the field of pulmonary delivery via the alveolar region of the 

15 lung have enabled the practical delivery of drugs such as insulin to blood circulation (see, e.g., Patton, 
J.S. et al., U.S. Patent No. 5,997,848). Pulmonary delivery has the advantage of administration 
without needle injection, and obviates the need for potentially toxic penetration enhancers. 

Compositions suitable for use in the invention include compositions wherein the active 
ingredients are contained in an effective amount to achieve the intended purpose. The determination 

20 of an effective dose is well within the capability of those skilled in the art. 

Specialized forms of compositions may be prepared for direct intracellular delivery of 
macromolecules comprising GCREC or fragments thereof. For example, liposome preparations 
containing a cell-impermeable macromolecule may promote cell fusion and intracellular delivery of 
the macromolecule. Alternatively, GCREC or a fragment thereof may be joined to a short cationic N- 

25 terminal portion from the HIV Tat-1 protein. Fusion proteins thus generated have been found to 

transduce into the cells of all tissues, including the brain, in a mouse model system (Schwarze, S.R. et 
al. (1999) Science 285:1569-1572). 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, 

30 monkeys, or pigs. An animal model may also be used to determine the appropriate concentration 
range and route of administration. Such information can then be used to determine useful doses and 
routes for administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example 
GCREC or fragments thereof, antibodies of GCREC, and agonists, antagonists or inhibitors of 

35 GCREC, which ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be 
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extracts taken from normal mammalian subjects, for example, human subjects, with antibodies to 
GCREC under conditions suitable for complex formation. The amount of standard complex 
formation may be quantitated by various methods, such as photometric means. Quantities of GCREC 
expressed in subject, control, and disease samples from biopsied tissues are compared with the 
5 standard values. Deviation between standard and subject values establishes the parameters for 
diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding GCREC may be used 
for diagnostic purposes. The polynucleotides which may be used include oligonucleotide sequences, 
complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect 

10 and quantify gene expression in biopsied tissues in which expression of GCREC may be correlated 
with disease. The diagnostic assay may be used to determine absence, presence, and excess 
expression of GCREC, and to monitor regulation of GCREC levels during therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of detecting polynucleotide 
sequences, including genomic sequences, encoding GCREC or closely related molecules may be used 

15 to identify nucleic acid sequences which encode GCREC. The specificity of the probe, whether it is 
made from a highly specific region, e.g., the 5' regulatory region, or from a less specific region, e.g., a 
conserved motif, and the stringency of the hybridization or amplification will determine whether the 
probe identifies only naturally occurring sequences encoding GCREC, allelic variants, or related 
sequences. 

20 Probes may also be used for the detection of related sequences, and may have at least 50% 

sequence identity to any of the GCREC encoding sequences. The hybridization probes of the subject 
invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO:9-16 or from 
genomic sequences including promoters, enhancers, and introns of the GCREC gene. 

Means for producing specific hybridization probes for DNAs encoding GCREC include the 

25 cloning of polynucleotide sequences encoding GCREC or GCREC derivatives into vectors for the 

production of mRNA probes. Such vectors are known in the art, are commercially available, and may 
be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA 
polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a 
variety of reporter groups, for example, by radionuclides such as 32 P or 35 S, or by enzymatic labels, 

30 such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotide sequences encoding GCREC may be used for the diagnosis of disorders 
associated with expression of GCREC. Examples of such disorders include, but are not limited to, a 
cell proliferative disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, 
hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal 

35 hemoglobinuria, polycythemia vera, psoriasis, primary thrombocythemia, and cancers including 
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obstruction, infections of the intestinal tract, peptic ulcer, cholelithiasis, cholecystitis, cholestasis, 
pancreatitis, pancreatic carcinoma, biliary tract disease, hepatitis, hyperbilirubinemia, cirrhosis, 
passive congestion of the liver, hepatoma, infectious colitis, ulcerative colitis, ulcerative proctitis, 
Crohn's disease, Whipple's disease, Mallory- Weiss syndrome, colonic carcinoma, colonic 
5 obstruction, irritable bowel syndrome, short bowel syndrome, diarrhea, constipation, gastrointestinal 
hemorrhage, acquired immunodeficiency syndrome (AIDS) enteropathy, jaundice, hepatic 
encephalopathy, hepatorenal syndrome, hepatic steatosis, hemochromatosis, Wilson's disease, alpha 
antitrypsin deficiency, Reye's syndrome, primary sclerosing cholangitis, liver infarction, portal vein 
obstruction and thrombosis, centrilobular necrosis, peliosis hepatis, hepatic vein thrombosis, veno- 

10 occlusive disease, preeclampsia, eclampsia, acute fatty liver of pregnancy, intrahepatic cholestasis of 
pregnancy, and hepatic tumors including nodular hyperplasias, adenomas, and carcinomas; an 
autoimmune/inflammatory disorder such as acquired immunodeficiency syndrome (AIDS), 
Addison's disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, 
anemia, asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, 

15 autoimmune polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, 
cholecystitis, contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes 
mellitus, emphysema, episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, 
erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves' 
disease, Hashimoto's thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple sclerosis, 

20 myasthenia gravis, myocardial or pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, 
polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, 
systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, 
ulcerative colitis, uveitis, Werner syndrome, complications of cancer, hemodialysis, and 
extracorporeal circulation, viral, bacterial, fungal, parasitic, protozoal, and helminthic infections, and 

25 trauma; a metabolic disorder such as diabetes, obesity, and osteoporosis; and an infection by a viral 
agent classified as adenovirus, arenavirus, bunyavirus, calicivirus, coronavirus, filovirus, 
hepadnavirus, herpesvirus, flavivirus, orthomyxovirus, parvovirus, papovavirus, paramyxovirus, 
picornavirus, poxvirus, reovirus, retrovirus, rhabdovirus, and tongavirus. The polynucleotide 
sequences encoding GCREC may be used in Southern or northern analysis, dot blot, or other 

30 membrane-based technologies; in PCR technologies; in dipstick, pin, and multiformat ELISA-like 
assays; and in microarrays utilizing fluids or tissues from patients to detect altered GCREC 
expression. Such qualitative or quantitative methods are well known in the art. 

In a particular aspect, the nucleotide sequences encoding GCREC may be useful in assays 
that detect the presence of associated disorders, particularly those mentioned above. The nucleotide 

35 sequences encoding GCREC may be labeled by standard methods and added to a fluid or tissue 
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encoding GCREC may be used to detect single nucleotide polymorphisms (SNPs). SNPs are 
substitutions, insertions and deletions that are a frequent cause of inherited or acquired genetic 
disease in humans. Methods of SNP detection include, but are not limited to, single-stranded 
conformation polymorphism (SSCP) and fluorescent SSCP (fSSCP) methods. In SSCP, 
5 oligonucleotide primers derived from the polynucleotide sequences encoding GCREC are used to 
amplify DNA using the polymerase chain reaction (PCR). The DNA may be derived, for example, 
from diseased or normal tissue, biopsy samples, bodily fluids, and the like. SNPs in the DNA cause 
differences in the secondary and tertiary structures of PCR products in single-stranded form, and 
these differences are detectable using gel electrophoresis in non-denaturing gels. In fSCCP, the 

10 oligonucleotide primers are fluorescently labeled, which allows detection of the amplimers in high- 
throughput equipment such as DNA sequencing machines. Additionally, sequence database analysis 
methods, termed in silico SNP (isSNP), are capable of identifying polymorphisms by comparing the 
sequence of individual overlapping DNA fragments which assemble into a common consensus 
sequence. These computer-based methods filter out sequence variations due to laboratory preparation 

15 of DNA and sequencing errors using statistical models and automated analyses of DNA sequence 
chromatograms. In the alternative, SNPs may be detected and characterized by mass spectrometry 
using, for example, the high throughput MASSARRAY system (Sequenom, Inc., San Diego CA). 

Methods which may also be used to quantify the expression of GCREC include radiolabeling 
or biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from 

20 standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C. 
et al. (1993) Anal. Biochem. 212:229-236.) The speed of quantitation of multiple samples may be 
accelerated by running the assay in a high-throughput format where the oligomer or polynucleotide of 
interest is presented in various dilutions and a spectrophotometric or colorimetric response gives 
rapid quantitation. 

25 In further embodiments, oligonucleotides or longer fragments derived from any of the 

polynucleotide sequences described herein may be used as elements on a microarray. The microarray 
can be used in transcript imaging techniques which monitor the relative expression levels of large 
numbers of genes simultaneously as described below. The microarray may also be used to identify 
genetic variants, mutations, and polymorphisms. This information may be used to determine gene 

30 function, to understand the genetic basis of a disorder, to diagnose a disorder, to monitor 

progression/regression of disease as a function of gene expression, and to develop and monitor the 
activities of therapeutic agents in the treatment of disease. In particular, this information may be used 
to develop a pharmacogenomic profile of a patient in order to select the most appropriate and 
effective treatment regimen for that patient. For example, therapeutic agents which are highly 

35 effective and display the fewest side effects may be selected for a patient based on his/her 
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prediction of toxicity. (See, for example, Press Release 00-02 from the National Institute of 
Environmental Health Sciences, released February 29, 2000, available at 
http://www.niehs.nih.gov/oc/news/toxchip.htm.) Therefore, it is important and desirable in 
toxicological screening using toxicant signatures to include all expressed gene sequences. 
5 In one embodiment, the toxicity of a test compound is assessed by treating a biological 

sample containing nucleic acids with the test compound. Nucleic acids that are expressed in the 
treated biological sample are hybridized with one or more probes specific to the polynucleotides of 
the present invention, so that transcript levels corresponding to the polynucleotides of the present 
invention may be quantified. The transcript levels in the treated biological sample are compared with 

10 levels in an untreated biological sample. Differences in the transcript levels between the two samples 
are indicative of a toxic response caused by the test compound in the treated sample. 

Another particular embodiment relates to the use of the polypeptide sequences of the present 
invention to analyze the proteome of a tissue or cell type. The term proteome refers to the global 
pattern of protein expression in a particular tissue or cell type. Each protein component of a 

15 proteome can be subjected individually to further analysis. Proteome expression patterns, or profiles, 
are analyzed by quantifying the number of expressed proteins and their relative abundance under 
given conditions and at a given time. A profile of a cell's proteome may thus be generated by 
separating and analyzing the polypeptides of a particular tissue or cell type. In one embodiment, the 
separation is achieved using two-dimensional gel electrophoresis, in which proteins from a sample are 

20 separated by isoelectric focusing in the first dimension, and then according to molecular weight by 
sodium dodecyl sulfate slab gel electrophoresis in the second dimension (Steiner and Anderson, 
supra) . The proteins are visualized in the gel as discrete and uniquely positioned spots, typically by 
staining the gel with an agent such as Coomassie Blue or silver or fluorescent stains. The optical 
density of each protein spot is generally proportional to the level of the protein in the sample. The 

25 optical densities of equivalently positioned protein spots from different samples, for example, from 
biological samples either treated or untreated with a test compound or therapeutic agent, are 
compared to identify any changes in protein spot density related to the treatment. The proteins in the 
spots are partially sequenced using, for example, standard methods employing chemical or enzymatic 
cleavage followed by mass spectrometry. The identity of the protein in a spot may be determined by 

30 comparing its partial sequence, preferably of at least 5 contiguous amino acid residues, to the 
polypeptide sequences of the present invention. In some cases, further sequence data may be 
obtained for definitive protein identification. 

A proteomic profile may also be generated using antibodies specific for GCREC to quantify 
the levels of GCREC expression. In one embodiment, the antibodies are used as elements on a 

35 microarray, and protein expression levels are quantified by exposing the microarray to the sample and 
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In another embodiment of the invention, nucleic acid sequences encoding GCREC may be 
used to generate hybridization probes useful in mapping the naturally occurring genomic sequence. 
Either coding or noncoding sequences may be used, and in some instances, noncoding sequences may 
be preferable over coding sequences. For example, conservation of a coding sequence among 
5 members of a multi-gene family may potentially cause undesired cross hybridization during 

chromosomal mapping. The sequences may be mapped to a particular chromosome, to a specific 
region of a chromosome, or to artificial chromosome constructions, e.g., human artificial 
chromosomes (HACs), yeast artificial chromosomes (YACs), bacterial artificial chromosomes 
(BACs), bacterial PI constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, J.J. 

10 et al. (1997) Nat. Genet. 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. 

(1991) Trends Genet. 7:149-154.) Once mapped, the nucleic acid sequences of the invention may be 
used to develop genetic linkage maps, for example, which correlate the inheritance of a disease state 
with the inheritance of a particular chromosome region or restriction fragment length polymorphism 
(RFLP). (See, for example, Lander, E.S. and D. Botstein (1986) Proc. Natl. Acad. Sci. USA 83:7353- 

15 7357.) 

Fluorescent in situ hybridization (FISH) may be correlated with other physical and genetic 
map data. (See, e.g., Heinz-Ulrich, et al. (1995) in Meyers, supra , pp. 965-968.) Examples of genetic 
map data can be found in various scientific journals or at the Online Mendelian Inheritance in Man 
(OMIM) World Wide Web site. Correlation between the location of the gene encoding GCREC on a 

20 physical map and a specific disorder, or a predisposition to a specific disorder, may help define the 
region of DNA associated with that disorder and thus may further positional cloning efforts. 

In situ hybridization of chromosomal preparations and physical mapping techniques, such as 
linkage analysis using established chromosomal markers, may be used for extending genetic maps. 
Often the placement of a gene on the chromosome of another mammalian species, such as mouse, 

25 may reveal associated markers even if the exact chromosomal locus is not known. This information is 
valuable to investigators searching for disease genes using positional cloning or other gene discovery 
techniques. Once the gene or genes responsible for a disease or syndrome have been crudely 
localized by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to 1 lq22-23, 
any sequences mapping to that area may represent associated or regulatory genes for further 

30 investigation. (See, e.g., Gatti, R.A. et al. (1988) Nature 336:577-580.) The nucleotide sequence of 
the instant invention may also be used to detect differences in the chromosomal location due to 
translocation, inversion, etc., among normal, carrier, or affected individuals. 

In another embodiment of the invention, GCREC, its catalytic or immunogenic fragments, or 
oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 

35 screening techniques. The fragment employed in such screening may be free in solution, affixed to a 
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Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
purity. In some cases, RNA was treated with DNase. For most libraries, poly(A)+ RNA was isolated 
using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles (QIAGEN, 
Chatsworth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, RNA was 
5 isolated directly from tissue lysates using other RNA isolation kits, e.g., the POLY(A)PURE mRNA 
purification kit (Ambion, Austin TX). 

In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 
libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the UNIZAP 
vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), using the 

10 recommended procedures or similar methods known in the art. (See, e.g., Ausubel, 1997, supra , units 
5.1-6.6.) Reverse transcription was initiated using oligo d(T) or random primers. Synthetic 
oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA was digested with the 
appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected (300- 
1000 bp) using SEPHACRYL S 1000, SEPHAROSE CL2B, or SEPHAROSE CL4B column 

15 chromatography (Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. cDNAs 
were ligated into compatible restriction enzyme sites of the polylinker of a suitable plasmid, e.g., 
PBLUESCRBPT plasmid (Stratagene), PSPORT1 plasmid (Life Technologies), PCDNA2.1 plasmid 
(Invitrogen, Carlsbad CA), PBK-CMV plasmid (Stratagene), or pINCY (Incyte Genomics, Palo Alto 
CA), or derivatives thereof. Recombinant plasmids were transformed into competent E. coli cells 

20 including XL 1 -Blue, XLl-BlueMRF, or SOLR from Stratagene or DH5a, DH10B, or ElectroMAX 
DH10B from Life Technologies. 
II. Isolation of cDNA Clones 

Plasmids obtained as described in Example I were recovered from host cells by in vivo 
excision using the UNIZAP vector system (Stratagene) or by cell lysis. Plasmids were purified using 

25 at least one of the following: a Magic or WIZARD Minipreps DNA purification system (Promega); an 
AGTC Miniprep purification kit (Edge Biosy stems, Gaithersburg MD); and QIAWELL 8 Plasmid, 
QIAWELL 8 Plus Plasmid, QIAWELL 8 Ultra Plasmid purification systems or the R.E.A.L. PREP 96 
plasmid purification kit from QIAGEN. Following precipitation, plasmids were resuspended in 0.1 
ml of distilled water and stored, with or without lyophilization, at 4°C. 

30 Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in a 

high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thermal 
cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 
384-weIl plates, and the concentration of amplified plasmid DNA was quantified fluorometrically 
using PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN E fluorescence 

35 scanner (Labsystems Oy, Helsinki, Finland). 



62 



WO 01/87937 



III. 



15 



-« A-*- „ were fenced as follows, 

such as the ABI CATALYST 800 HYDRA microdispenser (Robbins Scientific) 

using reagent proved by Amersh ^ Biosys(enls) . 

10 „ere c»ied out using the „ ^ „ corijm ction with standard ABI 

MlM1 Cffl.in*-l'- < * W j taow „ in the art. Reading 

1997,suEm. unlt7,7 ^ ^ omcui 

disclosed in Example Vm. inrvte cDNAs were validated by removing 

vector, .into, and po!y(A) se,»e„ces and by masta J ^ ^ 

tacyte cDNA sequences or translations thereof were " '° "^° rafc M(J ^i^ote databases, and 
BLOCKS, PRINTS, DOMO, PRODUM, ^ ^ 

s00 ct.resof gene families. S ">< ^ ^' ^ pASTA, BLIMPS, and HMMER. Tb. 

assess to full length. Assembly was pe fo ^ ^ 

,0 QeneMark, BLAST, and FASTA. T*W *P ^^ely, a polypeptide of the inventton 
t „e corresponding fnl, leng* polypepu* „ lsngth 

B^KS.PRINTS.DOMO.PRODOM.Prosi., 
GenBank protein databases (genpepO, Sw.ssPr m ^ 

a „d hidden Markov nude. (HMM)-based protetn tarty databases 



20 



35 



63 



WO 01/87937 



PCT/US01/16285 



polynucleotide sequences are also analyzed using MACDNASIS PRO software (Hitachi Software 
Engineering, South San Francisco CA) and LASERGENE software (DNASTAR). Polynucleotide 
and polypeptide sequence alignments are generated using default parameters specified by the 
CLUSTAL algorithm as incorporated into the MEGALIGN multisequence alignment program 
5 (DNASTAR), which also calculates the percent identity between aligned sequences. 

Table 7 summarizes the tools, programs, and algorithms used for the analysis and assembly of 
Incyte cDNA and full length sequences and provides applicable descriptions, references, and 
threshold parameters. The first column of Table 7 shows the tools, programs, and algorithms used, 
the second column provides brief descriptions thereof, the third column presents appropriate 

10 references, all of which are incorporated by reference herein in their entirety, and the fourth column 
presents, where applicable, the scores, probability values, and other parameters used to evaluate the 
strength of a match between two sequences (the higher the score or the lower the probability value, 
the greater the identity between two sequences). 

The programs described above for the assembly and analysis of full length polynucleotide 

15 and polypeptide sequences were also used to identify polynucleotide sequence fragments from SEQ 
ID NO:9-16. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization 
and amplification technologies are described in Table 4, column 4. 
IV. Identification and Editing of Coding Sequences from Genomic DNA 

Putative G-protein coupled receptors were initially identified by running the Genscan gene 

20 identification program against public genomic sequence databases (e.g., gbpri and gbhtg). Genscan is 
a general-purpose gene identification program which analyzes genomic DNA sequences from a 
variety of organisms (See Burge, C. and S. Karlin (1997) J. Mol. Biol. 268:78-94, and Burge, C. and 
S. Karlin (1998) Curr. Opin. Struct. Biol. 8:346-354). The program concatenates predicted exons to 
form an assembled cDNA sequence extending from a methionine to a stop codon. The output of 

25 Genscan is a FASTA database of polynucleotide and polypeptide sequences. The maximum range of 
sequence for Genscan to analyze at once was set to 30 kb. To determine which of these Genscan 
predicted cDNA sequences encode G-protein coupled receptors, the encoded polypeptides were 
analyzed by querying against PFAM models for G-protein coupled receptors. Potential G-protein 
coupled receptors were also identified by homology to Incyte cDNA sequences that had been 

30 annotated as G-protein coupled receptors. These selected Genscan-predicted sequences were then 
compared by BLAST analysis to the genpept and gbpri public databases. Where necessary, the 
Genscan-predicted sequences were then edited by comparison to the top BLAST hit from genpept to 
correct errors in the sequence predicted by Genscan, such as extra or omitted exons. BLAST analysis 
was also used to find any Incyte cDNA or public cDNA coverage of the Genscan-predicted 

35 sequences, thus providing evidence for transcription. When Incyte cDNA coverage was available, 
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GenBank protein homology the chimeric protein, or both were used as probes to search for 
homologous genomic sequences from the public human genome databases. Partial DNA sequences 
were therefore "stretched" or extended by the addition of homologous genomic sequences. The 
resultant stretched sequences were examined to determine whether it contained a complete gene. 
5 VI. Chromosomal Mapping of GCREC Encoding Polynucleotides 

The sequences which were used to assemble SEQ ID NO:9-16 were compared with 
sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other 
implementations of the Smith-Waterman algorithm. Sequences from these databases that matched 
SEQ ID NO:9-16 were assembled into clusters of contiguous and overlapping sequences using 

10 assembly algorithms such as Phrap (Table 7). Radiation hybrid and genetic mapping data available 
from public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for 
Genome Research (WIGR), and Genethon were used to determine if any of the clustered sequences 
had been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assignment 
of all sequences of that cluster, including its particular SEQ ID NO:, to that map location. 

15 Map locations are represented by ranges, or intervals, of human chromosomes. The map 

position of an interval, in centiMorgans, is measured relative to the terminus of the chromosome's p- 
arm. (The centiMorgan (cM) is a unit of measurement based on recombination frequencies between 
chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase (Mb) of DNA in 
humans, although this can vary widely due to hot and cold spots of recombination.) The cM 

20 distances are based on genetic markers mapped by Genethon which provide boundaries for radiation 
hybrid markers whose sequences were included in each of the clusters. Human genome maps and 
other resources available to the public, such as the NCBI "GeneMap^" World Wide Web site 
(http://www.ncbi.nlm.nih.gov/genemap/), can be employed to determine if previously identified 
disease genes map within or in proximity to the intervals indicated above. 

25 VIL Analysis of Polynucleotide Expression 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a 
gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs 
from a particular cell type or tissue have been bound. (See, e.g., Sambrook, supra , ch. 7; Ausubel 
(1995) supra , ch.4and 16.) 

30 Analogous computer techniques applying BLAST were used to search for identical or related 

molecules in cDNA databases such as GenBank or LIFESEQ (Incyte Genomics). This analysis is 
much faster than multiple membrane-based hybridizations. In addition, the sensitivity of the 
computer search can be modified to determine whether any particular match is categorized as exact or 
similar. The basis of the search is the product score, which is defined as: 



66 



WO 01/87937 



^ 

5 x minimum {length(Seq. 1), length(Seq. 2)} 



t hoth the degree of similarity between two sequences and the 

s length of the seouence mate, T>* pr ^ ^ ^ Md the 

calculated » Mows: <■» M^T 15 ortH rf , he , wo seqMnces , The BLAST .ore - 

calcuiated b, assign a score +5f J ^ ^ ^ ffip (sep „ ted by 

10 gaps). If there is more than one HSP, then , ^^^^^-t 

identity and 100% overlap. r n?FC are analyzed with respect to the 

tiss »e sources from which they were denved. Fo examp ^ ^ 

assembled, at least in part, with overlapping Incyte ^ ^jman tissue Each human tissue is 
. u r o ,nNA library constructed from a human tissue, c 
20 se^enceisdenvedfromacDNAhb,^ ries: ^.aseuta sysum, connective tts^e; 

genitalla, male; germ cells; hemtc and — ^ Massif—; 
system; pancreas; respirator, system; sense organs, sbn, P ^ 

- oru.ary.rac, ^^-3^--.-----. 

* CaK8 ° neS - ^ L developmental, inflammation, neurologic,, trauma, 
disease/condition categones: cancer, cell hne, P ^ ^ 

information are found in the LIFbbby uu 

M length polynucleotide secmences were als ^ ^ ^ ^ te 
primer was synthesized to initiate 5 extension 

67 



35 



WO 01/87937 



PCT/US01/16285 



synthesized to initiate 3' extension of the known fragment. The initial primers were designed using 
OLIGO 4.06 software (National Biosciences), or another appropriate program, to be about 22 to 30 
nucleotides in length, to have a GC content of about 50% or more, and to anneal to the target 
sequence at temperatures of about 68 °C to about 72 °C. Any stretch of nucleotides which would 
5 result in hairpin structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one 
extension was necessary or desired, additional or nested sets of primers were designed. 

High fidelity amplification was obtained by PCR using methods well known in the art. PCR 
was performed in 96-well plates using the PTC-200 thermal cycler (MJ Research, Inc.). The reaction 
10 mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg 2 *, (NH 4 ) 2 S0 4 , 
and 2-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE enzyme 
(Life Technologies), and Pfu DNA polymerase (Stratagene), with the following parameters for primer 
pair PCI A and PCI B: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 68°C, 
2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68°C, 5 min; Step 7: storage at 4°C. In the 
15 alternative, the parameters for primer pair T7 and SK+ were as follows: Step 1 : 94 °C, 3 min; Step 2: 
94°C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68°C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; 
Step 6: 68°C, 5 min; Step 7: storage at 4°C. 

The concentration of DNA in each well was determined by dispensing 100 ^1 PICOGREEN 
quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 
20 and 0.5 jil of undiluted PCR product into each well of an opaque fluorimeter plate (Corning Costar, 
Acton MA), allowing the DNA to bind to the reagent. The plate was scanned in a Fluoroskan II 
(Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 
concentration of DNA. A 5 }A to 10/^1 aliquot of the reaction mixture was analyzed by 
electrophoresis on a 1 % agarose gel to determine which reactions were successful in extending the 
25 sequence. 

The extended nucleotides were desalted and concentrated, transferred to 3 84- well plates, 
digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religation into pUC 18 vector (Amersham Pharmacia Biotech). For 
shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) 

30 agarose gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended clones 
were religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector (Amersham 
Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction site 
overhangs, and transfected into competent E. coli cells. Transformed cells were selected on 
antibiotic-containing media, and individual colonies were picked and cultured overnight at 37 °C in 

35 384-well plates in LB/2x carb liquid media. 
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supra ). Suggested substrates include silicon, silica, glass slides, glass chips, and silicon wafers. 
Alternatively, a procedure analogous to a dot or slot blot may also be used to arrange and link 
elements to the surface of a substrate using thermal, UV, chemical, or mechanical bonding 
procedures. A typical array may be produced using available methods and machines well known to 
5 those of ordinary skill in the art and may contain any appropriate number of elements. (See, e.g., 
Schena, M. et al. (1995) Science 270:467-470; Shalon, D. et al. (1996) Genome Res. 6:639-645; 
Marshall, A. and J. Hodgson (1998) Nat. Biotechnol. 16:27-31.) 

Full length cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof may 
comprise the elements of the microarray. Fragments or oligomers suitable for hybridization can be 

10 selected using software well known in the art such as LASERGENE software (DNASTAR). The 

array elements are hybridized with polynucleotides in a biological sample. The polynucleotides in the 
biological sample are conjugated to a fluorescent label or other molecular tag for ease of detection. 
After hybridization, nonhybridized nucleotides from the biological sample are removed, and a 
fluorescence scanner is used to detect hybridization at each array element. Alternatively, laser 

15 desorbtion and mass spectrometry may be used for detection of hybridization. The degree of 

complementarity and the relative abundance of each polynucleotide which hybridizes to an element 
on the microarray may be assessed. In one embodiment, microarray preparation and usage is 
described in detail below. 
Tissue or Cell Sample Preparation 

20 Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 

poly(A) + RNA is purified using the oligo-(dT) cellulose method. Each poly(A) + RNA sample is 
reverse transcribed using MMLV reverse-transcriptase, 0.05 pg/|ul oligo-(dT) primer (21mer), IX 
first strand buffer, 0.03 units/pl RNase inhibitor, 500 pM dATP, 500 pM dGTP, 500 |uM dTTP, 40 
HM dCTP, 40 iM dCTP-Cy3 (BDS) or dCTP-Cy5 (Amersham Pharmacia Biotech). The reverse 

25 transcription reaction is performed in a 25 ml volume containing 200 ng poly (A) + RNA with 

GEMBRIGHT kits (Incyte). Specific control poly(A) + RNAs are synthesized by in vitro transcription 
from non-coding yeast genomic DNA. After incubation at 37 °C for 2 hr, each reaction sample (one 
with Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and 
incubated for 20 minutes at 85° C to the stop the reaction and degrade the RNA. Samples are purified 

30 using two successive CHROMA SPIN 30 gel filtration spin columns (CLONTECH Laboratories, Inc. 
(CLONTECH), Palo Alto CA) and after combining, both reaction samples are ethanol precipitated 
using 1 ml of glycogen (1 mg/ml), 60 ml sodium acetate, and 300 ml of 100% ethanol. The sample is 
then dried to completion using a SpeedVAC (Savant Instruments Inc., Holbrook NY) and 
resuspended in 14 |il 5X SSC/0.2% SDS. 

35 Microarray Preparation 
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at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5. The excitation laser light is 
focused on the array using a 20X microscope objective (Nikon, Inc., Melville NY). The slide 
containing the array is placed on a computer-controlled X-Y stage on the microscope and raster- 
scanned past the objective. The 1.8 cm x 1.8 cm array used in the present example is scanned with a 
5 resolution of 20 micrometers. 

In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 
Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 
Hamamatsu Photonics Systems, Bridgewater NJ) corresponding to the two fluorophores. Appropriate 
filters positioned between the array and the photomultiplier tubes are used to filter the signals. The 

10 emission maxima of the fluorophores used are 565 nm for Cy3 and 650 nm for Cy5. Each array is 
typically scanned twice, one scan per fluorophore using the appropriate filters at the laser source, 
although the apparatus is capable of recording the spectra from both fluorophores simultaneously. 

The sensitivity of the scans is typically calibrated using the signal intensity generated by a 
cDNA control species added to the sample mixture at a known concentration. A specific location on 

15 the array contains a complementary DNA sequence, allowing the intensity of the signal at that 

location to be correlated with a weight ratio of hybridizing species of 1 : 100,000. When two samples 
from different sources (e.g., representing test and control cells), each labeled with a different 
fluorophore, are hybridized to a single array for the purpose of identifying genes that are 
differentially expressed, the calibration is done by labeling samples of the calibrating cDNA with the 

20 two fluorophores and adding identical amounts of each to the hybridization mixture. 

The output of the photomultiplier tube is digitized using a 12-bit RTI-835H analog-to-digital 
(A/D) conversion board (Analog Devices, Inc., Norwood MA) installed in an IBM-compatible PC 
computer. The digitized data are displayed as an image where the signal intensity is mapped using a 
linear 20-color transformation to a pseudocolor scale ranging from blue (low signal) to red (high 

25 signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and 
measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping 
emission spectra) between the fluorophores using each fluorophore* s emission spectrum. 

A grid is superimposed over the fluorescence signal image such that the signal from each 
spot is centered in each element of the grid. The fluorescence signal within each element is then 

30 integrated to obtain a numerical value corresponding to the average intensity of the signal. The 
software used for signal analysis is the GEMTOOLS gene expression analysis program (Incyte). 
XI. Complementary Polynucleotides 

Sequences complementary to the GCREC-encoding sequences, or any parts thereof, are used 
to detect, decrease, or inhibit expression of naturally occurring GCREC. Although use of 

35 oligonucleotides comprising from about 15 to 30 base pairs is described, essentially the same 
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shown in Examples XVI, XVII, and XVIII, where applicable. 

XIII. Functional Assays 

GCREC function is assessed by expressing the sequences encoding GCREC at 
physiologically elevated levels in mammalian cell culture systems. cDNA is subcloned into a 
5 mammalian expression vector containing a strong promoter that drives high levels of cDNA 

expression. Vectors of choice include PCMV SPORT (Life Technologies) and PCR3.1 (Invitrogen, 
Carlsbad CA), both of which contain the cytomegalovirus promoter. 5-10 n° of recombinant vector 
are transiently transfected into a human cell line, for example, an endothelial or hematopoietic cell 
line, using either liposome formulations or electroporation. 1-2 /j,g of an additional plasmid 

10 containing sequences encoding a marker protein are co-transfected. Expression of a marker protein 
provides a means to distinguish transfected cells from n on transfected cells and is a reliable predictor 
of cDNA expression from the recombinant vector. Marker proteins of choice include, e.g., Green 
Fluorescent Protein (GFP; Clontech), CD64, or a CD64-GFP fusion protein. Flow cytometry (FCM), 
an automated, laser optics-based technique, is used to identify transfected cells expressing GFP or 

15 CD64-GFP and to evaluate the apoptotic state of the cells and other cellular properties. FCM detects 
and quantifies the uptake of fluorescent molecules that diagnose events preceding or coincident with 
cell death. These events include changes in nuclear DNA content as measured by staining of DNA 
with propidium iodide; changes in cell size and granularity as measured by forward light scatter and 
90 degree side light scatter; down-regulation of DNA synthesis as measured by decrease in 

20 bromodeoxyuridine uptake; alterations in expression of cell surface and intracellular proteins as 
measured by reactivity with specific antibodies; and alterations in plasma membrane composition as 
measured by the binding of fluorescein-conjugated Annexin V protein to the cell surface. Methods in 
flow cytometry are discussed in Ormerod, M.G. (1994) Flow Cytometry . Oxford, New York NY. 
The influence of GCREC on gene expression can be assessed using highly purified 

25 populations of cells transfected with sequences encoding GCREC and either CD64 or CD64-GFP. 
CD64 and CD64-GFP are expressed on the surface of transfected cells and bind to conserved regions 
of human immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected 
cells using magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Lake 
Success NY). mRNA can be purified from the cells using methods well known by those of skill in 

30 the art. Expression of mRNA encoding GCREC and other genes of interest can be analyzed by 
northern analysis or microarray techniques. 

XIV. Production of GCREC Specific Antibodies 

GCREC substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 
Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is used to 
35 immunize rabbits and to produce antibodies using standard protocols. 
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system as described in Fields, S. and O. Song (1989) Nature 340:245-246, or using commercially 
available kits based on the two-hybrid system, such as the MATCHMAKER system (Clontech). 
GCREC may also be used in the PATHCALLING process (CuraGen Corp., New Haven CT) which 
employs the yeast two-hybrid system in a high-throughput manner to determine all interactions 
5 between the proteins encoded by two large libraries of genes (Nandabalan, K. et al. (2000) U.S. 
Patent No. 6,057,101). 

Potential GCREC agonists or antagonists may be tested for activation or inhibition of 
GCREC receptor activity using the assays described in sections XVII and XVIII. Candidate 
molecules may be selected from known GPCR agonists or antagonists, peptide libraries, or 

10 combinatorial chemical libraries. 

Methods for detecting interactions of GCREC with intracellular signal transduction 
molecules such as G proteins are based on the premise that internal segments or cytoplasmic domains 
from an orphan G protein-coupled seven transmembrane receptor may be exchanged with the 
analogous domains of a known G protein-coupled seven transmembrane receptor and used to identify 

15 the G-proteins and downstream signaling pathways activated by the orphan receptor domains 

(Kobilka, B.K. et al. (1988) Science 240:1310-1316). In an analogous fashion, domains of the orphan 
receptor may be cloned as a portion of a fusion protein and used in binding assays to demonstrate 
interactions with specific G proteins. Studies have shown that the third intracellular loop of G 
protein-coupled seven transmembrane receptors is important for G protein interaction and signal 

20 transduction (Conklin, B R. et al. (1993) Cell 73:631-641). For example, the DNA fragment 

corresponding to the third intracellular loop of GCREC may be amplified by the polymerase chain 
reaction (PCR) and subcloned into a fusion vector such as pGEX (Pharmacia Biotech). The construct 
is transformed into an appropriate bacterial host, induced, and the fusion protein is purified from the 
cell lysate by glutathione-Sepharose 4B (Pharmacia Biotech) affinity chromatography. 

25 For in vitro binding assays, cell extracts containing G proteins are prepared by extraction 

with 50 mM Tris, pH 7.8, 1 mM EGTA, 5 mM MgCl 2 , 20 mM CHAPS, 20% glycerol, 10 |ig of both 
aprotinin and leupeptin, and 20 \x\ of 50 mM phenylmethylsulfonyl fluoride. The lysate is incubated 
on ice for 45 min with constant stirring, centrifiiged at 23,000 g for 15 min at 4°C, and the 
supernatant is collected. 750 |ig of cell extract is incubated with glutathione S-transferase (GST) 

30 fusion protein beads for 2 h at 4°C. The GST beads are washed five times with phosphate-buffered 
saline. Bound G subunits are detected by [ 32 P]ADP-ribosylation with pertussis or cholera toxins. The 
reactions are terminated by the addition of SDS sample buffer (4.6% (w/v) SDS, 10% (v/v) 
p-mercaptoethanol, 20% (w/v) glycerol, 95.2 mM Tris-HCl, pH 6.8, 0.01% (w/v) bromphenol blue). 
The [ 32 P]ADP-labeled proteins are separated on 10% SDS-PAGE gels, and autoradiographed. The 

35 separated proteins in these gels are transferred to nitrocellulose paper, blocked with blotto (5% nonfat 
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perchloric acid. The cAMP levels in the lysate are measured by radioimmunoassay using methods 
well-known in the art. Changes in the levels of cAMP in the lysate from cells exposed to ligand 
compared to those without ligand are proportional to the amount of GCREC present in the transfected 
cells. 

5 To measure changes in inositol phosphate levels, the cells are grown in 24-well plates 

containing lxlO 5 cells/well and incubated with inositol-free media and [ 3 H]myoinositol, 2 (iCi/well, 
for 48 hr. The culture medium is removed, and the cells washed with buffer containing 10 mM LiCl 
followed by addition of ligand. The reaction is stopped by addition of perchloric acid. Inositol 
phosphates are extracted and separated on Dowex AG1-X8 (Bio-Rad) anion exchange resin, and the 

10 total labeled inositol phosphates counted by liquid scintillation. Changes in the levels of labeled 
inositol phosphate from cells exposed to ligand compared to those without ligand are proportional to 
the amount of GCREC present in the transfected cells. 
XVIII. Identification of GCREC Ligands 

GCREC is expressed in a eukaryotic cell line such as CHO (Chinese Hamster Ovary) or HEK 

15 (Human Embryonic Kidney) 293 which have a good history of GPCR expression and which contain a 
wide range of G-proteins allowing for functional coupling of the expressed GCREC to downstream 
effectors. The transformed cells are assayed for activation of the expressed receptors in the presence 
of candidate ligands. Activity is measured by changes in intracellular second messengers, such as 
cyclic AMP or Ca 2+ . These may be measured directly using standard methods well known in the art, 

20 or by the use of reporter gene assays in which a luminescent protein (e.g. firefly luciferase or green 
fluorescent protein) is under the transcriptional control of a promoter responsive to the stimulation of 
protein kinase C by the activated receptor (Milligan, G. et al. (1996) Trends Pharmacol. Sci. 17:235- 
237). Assay technologies are available for both of these second messenger systems to allow high 
throughput readout in multi-well plate format, such as the adenylyl cyclase activation FlashPlate 

25 Assay (NEN Life Sciences Products), or fluorescent Ca 2+ indicators such as Fluo-4 AM (Molecular 
Probes) in combination with the FLIPR fluorimetric plate reading system (Molecular Devices). In 
cases where the physiologically relevant second messenger pathway is not known, GCREC may be 
coexpressed with the G-proteins G al5/16 which have been demonstrated to couple to a wide range of G- 
proteins (Offermanns, S. and M.I. Simon (1995) J. Biol. Chem. 270:15175-15180), in order to funnel 

30 the signal transduction of the GCREC through a pathway involving phospholipase C and Ca 2+ 
mobilization. Alternatively, GCREC may be expressed in engineered yeast systems which lack 
endogenous GPCRs, thus providing the advantage of a null background for GCREC activation 
screening. These yeast systems substitute a human GPCR and G a protein for the corresponding 
components of the endogenous yeast pheromone receptor pathway. Downstream signaling pathways 

35 are also modified so that the normal yeast response to the signal is converted to positive growth on 
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What is claimed is: 

1. An isolated polypeptide selected from the group consisting of: 

a) a polypeptide comprising an amino acid sequence selected from the group consisting of 
5 SEQ ID NO: 1-8, 

b) a naturally occurring polypeptide comprising an amino acid sequence at least 90% 
identical to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-8, 

c) a biologically active fragment of a polypeptide having an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-8, and 

10 d) an immunogenic fragment of a polypeptide having an amino acid sequence selected from 

the group consisting of SEQ ID NO: 1-8. 

2. An isolated polypeptide of claim 1 selected from the group consisting of SEQ ID NO: 1-8. 
15 3. An isolated polynucleotide encoding a polypeptide of claim 1 . 

4. An isolated polynucleotide encoding a polypeptide of claim 2. 

5. An isolated polynucleotide of claim 4 selected from the group consisting of SEQ ID 
20 NO:9-16. 

6. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of claim 3. 

25 7. A cell transformed with a recombinant polynucleotide of claim 6. 

8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 

9. A method for producing a polypeptide of claim 1, the method comprising: 

30 a) culturing a cell under conditions suitable for expression of the polypeptide, wherein said 

cell is transformed with a recombinant polynucleotide, and said recombinant polynucleotide 
comprises a promoter sequence operably linked to a polynucleotide encoding the polypeptide of 
claim 1, and 

b) recovering the polypeptide so expressed. 
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10 . „ isolated antibody which specify binds » a p*P** -«*« >• 
* isdated polynucleotide — d tan* *»P — of: 

5 0 fSEQ.DNO:9-.6, . v „ widccompris i„ g artnu Cl eo,* S6 c,u»cea, ta s t 90* 
M a naturally occumng polynucleotide ' sfoiDNO'9-16, 

c) apo 1 ynudeo, i dec™p«.oaH^-«; 

d ) a polynucleotide complen^ntary » a polynucleotide of b>. a„d 

10 e) an RNA equivalent of a)-d). 

• ;„,TAHMSt 60 contiguous nucleotides of a 
12. An isolated polynucleotide comprising at least gu 

polynucleotide of claim 11. 

A , atarget pol y nucleotideinasample > saidtar ge tpolynucleotid e 
13 A method for detecting a target poiynuc. 

, > A* n frU\ml\ the method comprising: 
having a sequence of a polynucleotide of Can, 

comprising a sequence complementary s ^ , „ yb . totlon 

spKita „y h ybrid to ,osa i d Btg e t po.y™^ deorfragmen , sth e re of,and 

present, the amount thereof. 

a , an.plifyingsaid.arge.polynucleofideorfragment.Kereofusu,^ 

amplification, and ^nucleotide or fragment 

30 b) detecting the presence or absence of saro P 

thereof, and, optionally, if present, the amount thereof. 



25 



• inB a polypeptide of claim 1 and a pharmaceutical* acceptable 
16. A composition comprising a poiypepu 



excipient. 
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8 dm ' mS,ennS '° ' >° - *«* ~ ,„e c„ mposition 



19. A m ethodf„ rscreeninga ndfo 
'him 1 , the melhod comprising: g0mst of a Mypeptide of 

10 •) exposing a sample comprising a , 

» *-«n g a g o„i sla c t iv iIy m thesamp l P 

15 

claim 20. * * m need of ««* treatment a composition of 

22. Amet ^forscree„i„ g aco mp oundf ore ffective„es S asan af • 
<»f claan 1, the method comprising: g ° n,St ofa Peptide 

a) ex P osin ga Sa mpl e com P ri S ing ap o] y pe p ti deofc]aim , tn 

b) Acting antagonist activity in the I pl , " 1 * ' "~ - 



25 23. A 



panen, „ need of such aeataKa , composjfon ^ ^ ^ 

b) deteciing binding of , hep0 | . , , 

ypep.,de „f c ,a,m , ,„ * test compomd ^ .^.^ ^ 
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compound .hat specific* binds «o .he polyM* of «•*» »• 

compound >ha, modu,a,es «• ° f «* " *" ' ' 

^Uforteexpressionofftetargelpolynucleonde, 

20 c) comparing the expression of the target poiy 

of .he compound and in the absence of the compound. 

28 Amethod for assessingtoxici.yofa.es. compound, said method comprising: 

hy bridizarion complex ,s formed be.wee [ oU ^^oHUm 
sampk, said large, polynucleotide coropnsmg a poljwcte 



25 



1 1 or fragment thereof; 



30 



agmem uwiwi 
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10 



15 



in 

20 31. 



a) combining the biological sample with an antibody of clai m 10 h 

30. The antibody of claim 10, wherein the antibody is: 

a) a chimeric antibody, 

b) a single chain antibody, 

c) a Fab fragment, 

d) a F(ab') 2 fragment, or 

e) a humanized antibody. 

3 ■ • A c 0mpositio „ a „ Mtjbody rf ciaim w ^ _ ^ ^ 



33. A composite of claim 3]? ^ ^ ^ ^ 

33. c -mount ol the composition of claim 
35. A method of p reparing a polyc]ona] 

>rising: specificity of the antibody of claim 



lOcomprisin, 



30 



•ft* an antibody response: ' ' " M '""" m0geni<: -*r condition, ,o 

b) isolating antibodies from said animal; and 

— „^ y i« fyi „ gapolydo „ a , 
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groupconsistingofSEQIDNO:l-8. 

36. An antibody produced by a method of claim 35. 

5 • • -cina the antibody of claim 36 and a suitable carrier. 

37. A composition comprising the antiooay o 

3,A m e,hodofma k ingamonoc 1 o„ ^^^^^^ 

10 comprising: se auence selected from 

,, na nf SEO ID NO: 1-8, or an immunogenic fragment thereoi, u 
the group consisting ot iu nv. , 

elicit an antibody response; 

ixzsxszzz — 

15 producing hybridoma cells; 

39. A monoclonal antibody product by a method of claim 38. 

40. A composition comprising the antibody of Cairn 39 and a suitable earner. 

25 expression library. 

n u • tK. a ntibodv is produced by screening a recombinant 
42. The antibody of claim 10, wherein the antibody pr 

immunoglobulin library. 

30 43 a method for detecting a polypeptide having an amino acid seonence selected from the 

■ *■ „ nf SFO ID NO- 1-8 in a sample, comprising the steps of. 
group consisting of SEQ ID NO 1 8 P ^ ^ ^ ^ 

a) incubating the antibody of claim 10 with a sample 
binding of the antibody and the polypeptide; and 

99 
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polypeptide having an amino acid sequence seler^ f u 

the sample. ' ^ ^ ^ S ™* C0 ™^8 of SEQ ID NO:l-8 in 



10 



V NO. 1-8 from a sample, the method comprising 
a) incubating the antibody of claim in with , 

4 selected from the group consisting of SEQ ID NO:l-8. 



15 



20 



25 



45. A polypeptide of claim 1, comprising the 

46. A polypeptide of claim 1, comprising the 

47. A polypeptide of claim 1 , comprising the ; 



amino acid sequence of SEQ ID NO:l. 
amino acid sequence of SEQ ID NO:2. 



• ammo acid sequence of SEQ ID NO:3. 
48- ApoJypept.de ofclaim *><m^^^^ mN ^ 

A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:5. 



50. A polypeptide of claim 1, comprising the amino acid 



51. A polypeptide of claim 1, comprising the ; 



sequence of SEQ ID NO:6. 



amino acid sequence of SEQ ID NO:7. 

52. A polypeptide of claim 1, comprising the amino acid 

53. A polynucleotide of claim 1 1 



sequence of SEQ ID NO:8. 



COmpnS,ng the P 0 'y™cleotid e sequence of SEQ ID NO:9. 



30 NO: 10. 



54. A polynucleotide of claim 



11, comprising the polynucleotide 



sequence of SEQ ID 
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56. A polynucleotide of claim 1 1. comprising 



the polynucleotide sequence of SEQ ID 



NO:12. 



57. A polynucleotide of claim 11, comprising 



the polynucleotide sequence of SEQ ID 



NO:13. 



58. A polynucleotide of claim 11, comprising 



the polynucleotide sequence of SEQ ID 



NO:14. 



59. A polynucleotide of claim 11, comprising 



the polynucleotide sequence of SEQ ID 



N0:15. 



60 . A p*,— of Cain H,co m pri si n g U,epoly- e o t *se q „ M c e ofSEQip 



N0.16. 
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<141> Herewith 
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<160> 16 2000-06-02; 2000-06-02 

<170> PERL Program 

<210> 1 

<211> 372 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> mcyte ID No: 7474872CD1 

<400> 1 

- - ila As „ ser ser Thr Am sw 

r * * - -a * - - - »" - « « » 

:;; lleuAia M— — s 3 

Val Phe Leu Arcr Al« t 40 fp 

«« c ys A 50 9 WS Is ™ <~ S « ™ ^ 

-« c ys Ssn leu Ma ^ 55 ^ 

Pro Val , 65 " Thr Leu Ser Leu 

Val Arg Leu Ser Tyr Ty r A la Leu Has His t 75 

Asp Leu Leu Cys G !n , h , ** ^ Pr ° Phe ^ 

^ Gin Thr Thr Gly Ala He Ph« ni 90 

Tvr Glv q 95 ion Gln Met Asn Met 

Tyr Gly Ser cys lie Phe Leu Met Leu ^ 

Ala Ala He Val £2 p ^ ' ^ ^ 

Val f ls p ro Leu Arg Leu Ara w-ic T 120 

Arg Val Ala Arg Leu r ^ 

5 ^ ^ ^ °* ^ *P Ala Leu lie ^ 
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140 



Val 



Phe Ala Val Pro Ma Ala Arg 



155 



Arg Tyr 



Arg Asp Leu Glu Val Arg 



170 



Asp 



Glu Leu Trp Lys Gly Arg Leu 



185 



Glu Ala Leu Gly Phe Leu Leu Pro 



Ser 



Gly Arg Val Phe Trp Thr Leu 



215 



Ser Gin Arg 



Arg Arg Lys Thr Val 



230 



Leu Cys Phe Val 
245 



Val He Phe Leu 

val Tyr Gly Leu Leu Arg Ser Lys 
Ala Arg Asp Arg Val Arg Gly Val 



275 



Ala 



Gly Ala Asn Cys Val Leu Asp 



290 



Ala Glu Gly Phe Arg Asn Thr Leu 



Arg 



Ala Arg Thr Ser Ala Thr Asn 



320 



Gin Ser 



Glu Arg Ser Ala Val Thr 



335 



Ala 



Ala Ser Gin Gly Leu Leu Arg 



350 



Ser Ser 



Phe Thr Gin Cys Pro Gin 



365 



145 
Val His 
160 
Leu Cys 
175 
Leu Pro 
190 
Leu Ala 
205 
Ala Arg 
220 
Arg Leu 
235 
pro Tyr 
250 
Leu Val 
265 
Leu Met 
280 
Pro Leu 
295 
Arg Gly 
310 
Gly Thr 
325 
Thr Asp 
340 
pro Ser 
355 
Asp Ser 
370 



Arg Pro Ser 
Phe Glu Ser 
Leu Val Leu 
Ala Val Val 
Pro Asp Ala 
Leu Leu Ala 
Asn Ser Thr 
Ala Ala Ser 
Val Met Val 
Val Tyr Tyr 
Leu Gly Thr 
Arg Ala Ala 
Ala Thr Arg 
Asp Ser His 
Ala Leu 



150 
Arg cys 
165 
Phe Ser 
180 
Leu Ala 
195 
Tyr Ser 
210 
Thr Gin 
225 
Asn Leu 
240 
Leu Ala 
255 
Val Pro 
270 
Leu Leu 
285 
Phe Ser 
300 
Pro His 
315 
Leu Ala 
330 
Pro Asp 
345 
Ser Leu 
360 



<210> 2 

<211> 337 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> iuisc_feature 
<223> incyte ID No: 



<400> 2 
Met Asn 



6575963CD1 



Glu Pro Leu Asp Tyr Leu 

^ ^ Ala Ala Ala Phe Gly Asn 

Leu Lys Met His Tyr Leu Pro Val 

Val Gly Phe Pro Gly Asn Ala Val 
50 



Lys 



Met Arg Pro Trp Lys Ser Ser 



65 



Ala Cys 



Thr Asp Leu Leu Tyr Leu 



80 



His 



^ Tyr Ala Ser Gly Glu Asn 



95 



Phe He Arg Phe Ser Phe 



110 



Phe Leu Thr Cys Phe Ser 



Cys Lys 
lie Leu 

lie His Pro Met Ser Cys Phe Ser 

140 , m 

Val Val Ala Cys Ala Val Val Trp 



125 



Ala Asn 
10 

Cys Thr 
25 

lie Tyr 
40 

Val He 
55 

Thr He 
70 

Thr Ser 
85 

Trp He 
100 
His Phe 
115 
He Phe 
130 
He His 
145 
He He 



Ala Ser Asp 
Asp Glu Asn 
Gly He He 
Ser Thr Tyr 
He Met Leu 
Leu Pro Phe 
Phe Gly Asp 
Asn Leu Tyr 
Arg Tyr Cys 
Lys Thr Arg 
Ser Leu Val 



Phe Pro 
15 

He Pro 
30 

Phe Leu 
45 

He Phe 
60 

Asn Leu 
75 

Leu He 
90 

Phe Met 
105 
Ser Ser 
120 

Val He 
135 
Cys Ala 
150 
Ala Val 
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155 










160 










165 


lie 


Pro 


Met 


Thr 


Phe 


Leu 


He 


Thr 


Ser 


Thr 


Asn 


Arg 


Thr 


Asn 


Ara 










170 










175 










180 


Ser 


Ala 


Cys 


Leu 


Asp 


Leu 


Thr 


Ser 


Ser Asp 


Glu 


Leu 


Asn 


Thr 


He 










185 










190 










195 


Lys 


Tro 


Tyr 


Asn 


Leu 


He 


Leu 


Thr 


Ala 


Thr 


Thr 


Phe 


Cvs 


Leu 


Pro 










200 










205 










210 


Leu 


Val 


He 


Val 


Thr 


Leu 


Cys 


Tvr 


Thr 


Thr 


He 


He 


His 


Thr 


Leu 










215 










220 










225 


Thr 


His 


Gly Leu 


Gin 


Thr 


Asp 


Ser 


Cys 


Leu 


Lys 


Gin 


Lvs 


Ala 


Arg - 










230 










235 










240 


Arg 


Leu 


Thr 


He 


Leu 


Leu 


Leu 


Leu 


Ala 


Phe 


Tvr 


Val 


Cvs 


Phe 


Leu 










245 










250 










255 


Pro 


Phe 


His 


He 


Leu 


Arg 


Val 


He 


Arg 


He 


Glu 


Ser 


Arg 


Leu 


Leu 










260 










265 










270 


Ser 


lie 


Ser 


Cys 


Ser 


He 


Glu 


Asn 


Gin 


He 


His 


Glu 


Ala 


Tvr 


He 










275 










280 










285 


Val 


Ser 


Arg 


Pro 


Leu 


Ala 


Ala 


Leu 


Asn 


Thr 


Phe 


Glv 


Asn 


Leu 


Leu 










290 










295 










300 


Leu 


Tvr 


Val 


Val 


Val 


Ser 


Asp 


Asn 


Phe 


Gin 


Gin 


Ala 


Val 


Cys 


Ser 










305 










310 










— > j — > 


Thr 


Val 


Arg 


Cys 


Lys 


Val 


Ser 


Gly 


Asn 


Leu 


Glu 


Gin 


Ala 


Lys 


Lys 










320 










325 










330 


He 


Ser 


Tyr 


Ser 


Asn 


Asn 


Pro 


























335 























<210> 3 

<211> 346 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 7474846CD1 



<400> 3 
























Met Tyr 


Asn 


Gly 


Ser 


Cys 


Cys 


Arg 


He Glu 


Gly Asp 


Thr 


He 


Ser 


1 






5 








10 








15 


Gin Val 


Met 


Pro 


Pro 


Leu 


Leu 


He 


Val Ala 


Phe Val 


Leu 


Gly 


Ala 








20 








25 








30 


Leu Gly 


Asn 


Gly 


Val 


Ala 


Leu 


Cys 


Gly Phe 


Cys Phe 


His 


Met 


Lys 








35 








40 








45 


Thr Trp 


Lys 


Pro 


Ser 


Thr 


Val 


Tyr 


Leu Phe 


Asn Leu 


Ala 


Val 


Ala 








50 








55 








60 


Asp Phe 


Leu 


Leu 


Met 


lie 


Cys 


Leu 


Pro Phe 


Arg Thr 


Asp 


Tyr 


Tyr 








65 








70 








75 


Leu Arg 


Arg 


Arg 


His 


Trp 


Ala 


Phe 


Gly Asp 


He Pro 


Cys 


Arg 


Val 








80 








85 








90 


Gly Leu 


Phe 


Thr 


Leu 


Ala 


Met 


Asn 


Arg Ala 


Gly Ser 


lie 


Val 


Phe 








95 








100 








105 


Leu Thr 


Val 


Val 


Ala 


Ala 


Asp 


Arg 


Tyr Phe 


Lys Val 


Val 


His 


Pro 








110 








115 








120 


His His 


Ala 


Val 


Asn 


Thr 


He 


Ser 


Thr Arg 


Val Ala 


Ala 


Gly 


He 








125 








130 








135 


Val Cys 


Thr 


Leu 


Trp 


Ala 


Leu 


Val 


He Leu 


Gly Thr 


Val 


Tyr 


Leu 








140 








145 








150 


Leu Leu 


Glu 


Asn 


His 


Leu 


Cys 


Val 


Gin Glu 


Thr Ala 


Val 


Ser 


Cys 








155 








160 








165 


Glu Ser 


Phe 


He 


Met 


Glu 


Ser 


Ala 


Asn Gly 


Trp His 


Asp 


lie 


Met 








170 








175 








180 


Phe Gin 


Leu 


Glu 


Phe 


Phe 


Met 


Pro 


Leu Gly 


He He 


Leu 


Phe 


Cys 








185 








190 








195 


Ser Phe 


Lys 


He 


Val 


Trp 


Ser 


Leu 


Arg Arg 


Arg Gin 


Gin 


Leu 


Ala 
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Arg Gin Ala 
Ala He Val 
Leu Tyr Phe 
Val His Gly 
Ser Met Leu 
Pro Lys Phe 
Gin Pro Gly 
Ser Asn Leu 
Gin Ser Gin 
His 



200 
Arg Met 
215 
Phe He 
230 
Leu Trp 
245 
Ala Leu 
260 
Asp Pro 
275 
Tyr Asn 
290 
His Ser 
305 
Gly Arg 
320 
Ser Asp 
335 



Lys Lys 
Thr Cys 
Thr Val 
His He 
Leu Val 
Lys Leu 
Lys Thr 
Arg Ser 
Gly Gin 



Ala Thr 
Tyr Leu 
Pro Ser 
Thr Leu 
Tyr Tyr 
Lys He 
Gin Arg 
Cys He 
Trp Asp 



205 

Arg Phe 
220 

Pro Ser 
235 

Ser Ala 
250 

Ser Phe 
265 

Phe Ser 
280 

Cys Ser 
295 

pro Glu 
310 

Ser Val 
325 

Pro His 
340 



He Met 
Val Ser 
Cys Asp 
Thr Tyr 
Ser Pro 
Leu Lys 
Glu Met 
Ala Asn 
He Val 



210 
Val Val 
225 
Ala Arg 
240 
Pro Ser 
255 
Met Asn 
270 
Ser Phe 
285 
Pro Lys 
300 
pro He 
315 
Ser Phe 
330 
Glu Trp 
345 



<210> 4 
<2H> 432 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc^feature 
<223> incyte ID No: 



1258785CD1 



<400> 4 

Met Glu Glu Arg Ala 

1 5 
Ala Ala Gly Ala Leu 

Cys Val Arg Pro Pro 

His Gin Asp Gly Lys 
50 

Thr Lys He Lys Asp 
Thr Ala Asp Pro His 
He Ala Leu Leu Tyr 
Thr Tyr Leu Leu Arg 

Asn Leu Asn Gly Arg 

12b 

Pro Leu Ala Val Gly 
Cys Glu Ser Gin Glu 
Ser Val Val Cys Gin 
Gly Arg Ala Gly Tyr 
He Gly Pro Gly Thr 
Asn Ala He lie Glu 
Lys Thr Glu Arg Cys 



phe Val 
Leu Ala 
Ala Thr 
lie He 
Leu Leu 
Asp Cys 
Leu Gin 
Ser Leu 
Arg Val 
Ala Val 
Cys Val 
Glu Ser 
Leu Tyr 
Val Cys 
Ser Gly 
Pro Leu 



Asn Pro 
Ser Gly 
Thr Asp 
His Asn 
Gin Gin 
Ser Ala 
Leu Tyr 
Asp Tyr 
Thr Phe 
He Tyr 
Thr Lys 
Asp Leu 
Ala Leu 
Glu Ser 
Lys Thr 
Leu Tyr 



Phe Pro 
10 

Ala Ala 

25 
Glu Pro 
40 

Phe He 
55 

Met Glu 

70 
Tyr Thr 
85 

Arg Val 
100 

Val Lys 
115 

Leu Cys 
130 

His Lys 
145 

Leu Leu 
160 

pro Asp 
175 

Leu Tyr 
190 

Ala He 
205 

Leu Ser 
220 

Gin Trp 



Asp Tyr 
Glu Glu 
Gly Leu 
Arg Arg 
Glu Gly 
Gly Trp 
Thr Cys 
Arg Thr 
Gly Asp 
Leu Arg 
Gin Leu 
Glu Leu 
Leu Asn 
Lys Glu 
Arg Glu 
His Arg 



Glu Ala 
15 

Thr Gly 
30 

Pro Phe 
45 

He Gin 

60 

Leu Lys 
75 

Thr Gly 
90 

Asp Gin 
105 
Leu Arg 
120 
Ala Gly 
135 
Ser Asp 
150 
Gin Arg 
165 
Leu Tyr 
180 
Thr Glu 
195 
Val Val 
210 
Glu Arg 
225 
Lys Gin 
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230 










Tyr 


Val 


Gly Ala 


Ala 


His 


Gly 


Met 


Ala 










245 










Met 


Gin 


Pro 


Ala 


Ala 
260 


Lys 


Val 


Asp 


Gin 


Val 


Lys 


Pro 


Ser 


He 
275 


Asp 


Tyr 


Val 


Arg 


Gly Asn 


Tyr 


Pro 


Ser 


Ser 


Leu 


Ser 


Asn 










290 










His 


Trp 


Cys 


His 


Gly 


Ala 


Pro 


Gly Val 










305 










Ala 


Tyr 


Lys 


Val 


Phe 
320 


Lys 


Glu 


Glu 


Lys 


Glu 


Cys 


Ser 


Asp 


Val 
335 


He 


Trp 


Gin 


Arg 


Tyr 


Gly 


He 


Cys 


His 


Gly 


Thr 


Ala Gly 










350 










Ser 


Leu 


Tyr 


Arg 


Leu 
365 


Thr 


Gin 


Asp 


Lys 


Cys 


Lys 


Phe 


Ala 


Glu 
380 


Trp 


Cys 


Leu 


Asp 


Arg 


He 


Pro 


Asp 


Arg 
395 


Pro 


Tyr 


Ser 


Leu 


Ala 


He 


His 


Phe 


Leu 
410 


Ser 


Asp 


Val 


Leu 


Phe 


Pro 


Ala 


Phe 


Glu 
425 


Leu 


Asp 


Ser 


Ser 



<210> 5 
<211> 240 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 1874944CD1 

<400> 5 



Met Pro 


Val 


Leu 


Leu 


His 


Tyr 


Phe 


Phe 


1 






5 










Met Leu 


Val 


Glu 


Gly 
20 


Leu 


His 


Leu 


Tyr 


Phe Gly 


Ser 


Glu 


Asp 
35 


Ser 


Lys 


His 


Arg 


Trp Gly 


Phe 


Pro 


Leu 
50 


Leu 


He 


Cys 


He 


Met Asp 


Ser 


Tyr 


Gly 
65 


Thr 


Ser 


Asn 


Asn 


Ser Gly 


Ala 


He 


Trp 
80 


Ala 


Phe 


Val 


Ala 


Val Val 


Asn 


He 


Gly 
95 


He 


Leu 


He 


Ala 


Gin He 


Ser 


Ala 


Asp 
110 


Asn 


Tyr 


Lys 


He 


Phe Lys 


Leu 


Thr 


Ala 
125 


Lys 


Ala 


Val 


Ala 


Gly Thr 


Ser 


Trp 


Val 
140 


Phe 


Gly 


Val 


Leu 


Val Val 


Phe 


Gin 


Tyr 
155 


Met 


Phe 


Ala 


Thr 


Leu Phe 


lie 


Phe 


Leu 
170 


Phe 


His 


Cys 


Leu 


Ala Ala 


Phe 


Lys 


His 


Lys 


Thr 


Lys 


Val 
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235 










240 


Gly 


He 


Tyr 


Tyr 


Met 


Leu 


250 










255 


Glu 


Thr 


Leu 


Thr 


Glu 


Met 


265 










270 


His 


Lys 


Lys 


Phe 


Arg 


Ser 


280 










285 


Glu 


Thr 


Asp 


Arg 


Leu 


Val 


295 










300 


He 


His 


Met 


Leu 


Met 


Gin 


310 










315 


Tyr 


Leu 


Lys 


Glu 


Ala 


Met 


325 










330 


Gly 


Leu 


Leu 


Arg 


Lys 


Gly 


340 










345 


Asn 


Gly Tyr 


Ser 


Phe 


Leu 


355 










360 


Lys 


Tyr 


Leu 


Tyr 


Arg 


Ala 


370 










375 


Tyr 


Gly Ala 


His 


Gly 


Cys 


385 










390 


Phe 


Glu 


Gly Met 


Ala 


Gly 


400 










405 


Gly 


Pro 


Glu 


Thr 


Ser 


Arg 


415 










420 


Lys 


Arg 


Asp 








430 












Leu 


Ser 


Ala 


Phe 


Ala 


Trp 


10 










15 


Ser 


Met 


Val 


He 


Lys 


Val 


25 










30 


Tyr 


Tyr Tyr Gly Met 


Gly 


40 










45 


He 


Ser 


Leu 


Ser 


Phe 


Ala 


55 










60 


Cys 


Trp 


Leu 


Ser 


Leu 


Ala 


70 










75 


Pro 


Ala 


Leu 


Phe 


Val 


He 


85 










90 


Val 


Thr 


Arg 


Val 


He 


Ser 


100 










105 


His 


Gly Asp 


Pro 


Ser 


Ala 


115 










120 


Val 


Leu 


Leu 


Pro 


He 


Leu 


130 










135 


Ala 


Val 


Asn 


Gly Cys 


Ala 


145 










150 


Leu 


Asn 


Ser 


Leu 


Gin 


Gly 


160 










165 


Leu 


Asn 


Ser 


Glu 


Val 


Arg 


175 










180 


Trp 


Ser 


Leu 


Thr 


Ser 


Ser 
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195 

190 



^ S Asn Ala Lys Pro Phe His Ser Asp Leu Met 
ser Ala Arg Thr Ser 5 ^ ^ 

As n Gly Thr Arg Pro Gly Met Ala Ser Thr Ly 5 

S er His Ser Ala His Arg Val Asp Leu Ser Ala Val 
Asp Lys Ser Ser His ser ^ 



<210> 6 
<2H> 271 
<212> PRT 

<213> Homo sapiens 



<220> 



< 100 > 6 T „ val Asn Gin ser Tyr Thr WP Gly Phe «» ™ 
Met Gly Ms Trp v.1 10 

, 1 G1 y lie Phe si His ser Gin Thr Asp Leu val ...» 3Q 
Leu Gly lie wi 2 5 x Leu 

w i Phe Thr Val Ala Leu Cys Gly Asn 

Ala Val Me, Val P ^ „ _ ^ pro 

Leu Xle Phe Leu lie *r ^ ^ _ 

Tyr Phe Phe Leu Ser Gin Leu Ser ^ 

, Ile Val Pro Lys Met Ala Ala Asn Phe Leu Ser Gly A g g 

Cys Asn He Val Pr 85 phe phe 

Tlp ser phe Val Gly cys Gly lie Gin 1Q5 
LYS Ser He Ser Ftie 1QQ ^ 

„ al At ser Glu Gly Leu Leu Leu Gly Leu 
Val Ser Leu Val Gly 115 

* „ Ara Tyr Val Ala Val Ser His Pro Leu His Tyr ^ 
Tyr Asp Arg Tyr 130 ger Ser Trp 

tr=T pv*? Leu Gin lie it"- ^-*-Jf icq 
^^Treirpl vaine^MetValAlaAlaMet 
Ala Phe Gly He lie Asp y > phe Phe Trp 

Gly L eu Pro Tyr Cys Gly Ser Arg Ser Val ^ 
G1Y 11 170 , Trp Lys Lys Ala Leu 

!rn T-i- Ar g Ser Ala Gin Ala irp y ig5 
Ma Val Leu Arg lie Arg 19Q 

I 85 v. *i « val Thr Leu Phe Tyr Gly 

„ cot- c er His Leu Thr Ala Val inr 210 
AXa Thr Cys Ser Ser ^ ^ 

r;r::sr.::.::|™~— 1 

Gly A la Leu Arg Lys Gly Leu Asp Arg Cys Arg 



260 

His 



<210> 7 
<2H> 276 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> misc_f eature 

<223> Incyte ID No: 55000189CD1 

<400> 7 



1*1 e L- 


A T~rr 
ax y 


rVI y 




A c; n 


utr Li. 


Thr* 


wlLl 


vai 


1 111 


ulU 


xr lie 


Va 1 
Val 


irfie 


Leu 


1 

1 








-J 










1 u 










1 D 






Coy 


mi y 


XT 11C 


n i o 




nib 


ni o 


Tl o 

lie 


1 ill 


Leu 


irii.e 


vai 


u 3 1 

v ai 






























■3 U 




ue u. 


lie 


ue u 


Tvrr* 
iyx 


i i n 


JjCU 


TVit 
i in 


Va 1 
vai 


Al a 
riXa 


uiy 


Asn 


Ala 
Ala 


Tl o 
lie 


T 1 a 

ne 




















f± U 












11 e L. 


Thr 

X Hi 


Tip 
lie 


Tie 

lie 




lie 


A en 


A VfT 

ril y 


n i o 


Leu 


nib 


i nr 


tr I O 


Mot- 
l*le U 


lyr 










50 










J -J 










o u 




PVl£=k 

Jtrile 


Leu 


Del 


1*1 c u 


Leu 


A 1 a 




oei 


Lys 


Tnr 


vai 


Tyr 


i nr 


Leu 










U -J 










/ u 










/ D 


rile 


x xe 


i i e 


Pr O 


r*i n 




Leu 


Ser 


Cqv- 

oci 


rne 


vai 


Thr 


Gin 


inr 


Vain 










o u 




















q n 
y u 


Pro 


i le 


Seir 


Leu 


Ala 
Ala 


oiy 


Cys 


Tnr 


j. nr 


bin 


i nr 


Phe 


Phe 


pne 


vai 




















100 










1U j 


i nr 


Leu 


Ala 


Tl <=> 

ile 


Asn 


Asn. 


Cys 


irne 


Leu 


Leu 


Tnr 


Val 


Met 


biy 


Tyr 










11 u 










115 










ion 


Asp 


HIS 


Tyr 


wet 


Ala 


Tl Q 

ne 


Cys 


Asn 


Pro 


Leu 


Arg 


Tyr 


Arg 


Val 


lie 










1 Z _> 










130 










li D 


Thr 


Ser 


Lys 


Lys 


vai 


Cys 


vai 


win 


Leu 


Val 


Cys 


Gly Ala 


Pne 


Ser 










14t u 










145 










i c n 

IdO 


Tl- 


pi,, 
tjiy 


Leu 


Aia 


jxiec 


Aia 


Ala 


vai 


Gin 


Val 


Tnr 


Ser 


He 


Phe 


Thr 










1 cc 
J. J J 










160 










lOD 


Leu 


Pro 


Frie 


Cys 


nlS 


inr 


vai 


vai 


Gly His 


Pne 


Phe 


Cys 


Asp 


He 










1 / U 










175 










1 0 0 


Leu 


PJTO 


vai 


Met 


Lys 


Leu 


Ser 


Cys 


He 


Asn 


Thr 


Thr 


He 


Asn 


Glu 




















190 










195 


lie 


lie 


Asn 


Pne 


vai 


vai 


Arg 


Leu 


Phe 


Val 


lie 


Leu 


Val 


Pro 


Met 










2 00 










205 










210 


Gly 


Leu 


Val 


Phe 


He 


Ser 


Tyr 


Val 


Leu 


He 


He 


Ser 


Thr 


Val 


Leu 










215 










220 










225 


Lys 


He 


Ala 


Ser 


Ala 


Glu 


Gly 


Trp 


Lys 


Lys 


Thr 


Phe 


Ala 


Thr 


Cys 










230 










235 










240 


Ala 


Phe 


His 


Leu 


Thr 


Val 


Val 


He 


Val 


His 


Tyr 


Gly Cys 


Ala 


Ser 










245 










250 










255 


He 


Ala 


Tyr 


Leu 


Met 


Pro 


Lys 


Ser 


Glu 


Asn 


Ser 


He 


Glu 


Gin 


Asp 










260 










265 










270 


Leu 


Leu 


Leu 


Ser 


Val 


Thr 





















275 

<210> 8 
<211> 408 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7474839CD1 

<400> 8 



Met 


Lys 


Lys 


Arg 


Glu 


Ser 


Leu 


Thr 


Gin 


Leu 


Arg 


Ser 


Pro 


Trp 


Val 


1 








5 










10 










15 


Val 


Ser 


Val 


Phe 


Gly 


Ala 


Leu 


He 


Thr 


Val 


Ala 


Arg 


Phe 


Leu 


Asp 










20 










25 










30 


Leu 


Val 


Pro 


Thr 


Gin 


Arg 


Asn 


Phe 


Phe 


Lys 


Pro 


Val 


Arg 


Pro 


Val 










35 










40 










45 


Pro 


Ser 


Phe 


Ala 


Tyr 


Pro 


Leu 


Ser 


Gin 


Asp 


Arg 


Thr 


Pro 


Gin 


Phe 










50 










55 










60 


Leu 


Pro 


Pro 


Thr 


Leu 


His 


Leu 


Ser 


Lys 


Ala 


Arg 


Gly 


He 


Thr 


Leu 
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Pro si* £ «• «y - <*• »i - - Gln | 

Le „ vai p™ xxe xi - *r « «. ox, ^ v»x f a 
61a ^ r Phe t .u £ vaX «y XXe Pro «y » «y W - II. 

Bis Ph « w i£ - » - - s; - - Ala & 

12 5 - T , Ara V al Glu Arg 

Th r Leu Gly Asn Leu Thr He Val Leu lie He Arg 

140 ^ Toll Met Leu Ser Thr 

Arg Leu His Glu Pro Met Tyr Leu Phe Leu Ala ^ 

155 „ ti. Thv Met Pro Lys Met Ala Ser 
lie Asp Leu Val Leu Ser Ser He Thr Met y 18Q 

v t Met Gil lie Gin Glu lie Glu Phe Asn He Cys Leu 
Leu Phe Leu Met Gly lie *x 19Q 19s 

185 • T£ ,» <?Pr Ala Val Glu Ser Ala 

Ala Gin Met Phe Leu lie His Ala Leu Ser 

200 , «v,o val Ala He Cys His 

Val Leu Leu Ala Met Ala Phe Asp Arg Phe Val ^ 

215 „ i T «, Thr Gly Cys Thr Val Ala Lys 
Pro Leu Arg His Ala Ser Val Leu Thr Gly cy ^ 

230 „*. *™ rlv Phe Val Phe Phe Phe Pro 
lie Gly Leu Ser Ala Leu Thr Arg Gly Phe 

245 „. t •„ q er Tvr Cys Gin Thr His Thr 
Leu Pro Phe lie Leu Lys Trp Leu Ser Tyr cy 27Q 

. c III cvs Leu His Gin Asp He Met Lys Leu Ser 

Val Thr His Ser Phe Cys Leu n ^ 2 g5 

275 „ i Tr=,i Tvr Glv Leu Phe He He 

cys Thr Asp Thr Arg Val Asn Val Val Tyr Gly 3Q0 

290 e r m , Phe He Gly Phe Ser Tyr 

Leu Ser Val Met Gly Val Asp Ser Leu Phe He 

305 — -» T „. cor Ser Arg Arg Ala 

Ile Leu He Leu Trp Ala Val Leu Glu Leu Ser ^ 

320 Tla ql r His Leu Cys Ala Val 

Ala Leu Lys Ala Phe Asn Thr Cys He Ser His Cy ^ 

33 ^ „ t«, He Gly Leu Ser Val Val His Arg 
Leu Val Phe Tyr Val Pro Leu He Gly l^ 360 

,eu Pro Z « ~ «. ^ -X Val Met ,X, Asn £ 

^ „, Leu ^ HI P" VaX V.l Pro Val W «* *la 
ws Thr Ota ||| - « ^ S ^ ^ ^ 

Gly Gly Lys 



<210> 9 

<2H> 2444 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature , 
<223> incyte ID No: 7474872CB1 



<400> 9 v ^trccc tcccgccccg ggagcaggtc ctaccagccc 60 

cgtgagatct gctgaggtgg ^^cc gcggaagcca gcttggggca gcgcagagca 120 
agcccagccc agcccagccc a^aggca aa acgcaccc aacccttggc 180 

acacggagca caggtctctg ctgctgatga * *J ctccctctgc ttgttttggg 240 
agccatctgt ccctgcagcc atagccca cc aatg ccatgc catgattctg 300 

accatgtctg tacagcctct aggccc^ a 
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gtgtgctcca 
cagctcctca 
gcacttggtg 
ctgggtcttc 
ggcggccagc 
gcaccactgg 
catgtacggc 
gcacccgctg 
cgtgtgggcg 
ttgccgctac 
gaaaggcagg 
ggcggcggtg 
gcagagccag 
gctgtgcttc 
ggtggcggcc 
gctggccggc 
ccgcaacacc 
gacgcgggcg 
ggatgccgcc 
acagtgtccc 
tcccaacgcc 
acttggacct 
ccagggccca 
gcaccccttc 
aggacgaaat 
agaaaatggg 
tttggtggac 
atgccacccc 
gagttcctat 
ggcatccagc 
acatccaagt 
cctcctccat 
tctctcctct 
gtaaggataa 
actcttagaa 
catactagaa 



tggcatcccc 
accaacagtt 
gtctacagct 
ctgcgcgcgc 
gacctgctct 
cccttccccg 
agctgcatct 
cgactgcgcc 
ctcatcctgg 
cgggacctcg 
ctgctgcccc 
gtctactcgt 
cggcggcgga 
gtgccctaca 
agcgtgcctg 
gccaactgcg 
ctgcgcggcc 
gcgctcgcgc 
agtcaggggc 
caggattccg 
tctcgttctg 
ttgggtggca 
gggaaggcag 
tccagcggtg 
gcctttgggt 
cattctggat 
aaagctactg 
ctttcatttc 
ggacagaact 
atgggatagt 
ccaaaccctt 
tttaggtcct 
cctctctctt 
aattctttct 
attgaggatc 
attaaagcga 



agcctagctc 
ctghtctccc 
tggtgctggc 
tgcgcgtgca 
tcaccctctc 
acctcctgtg 
tcctgatgct 
acctgcggcg 
tgtttgccgt 
aggtgcgcct 
tcgtgctgct 
cgggccgagt 
agaccgtgcg 
acagcacgct 
cccgcgatcg 
tgctggaccc 
tgggcactcc 
aatccgaaag 
tgctccgacc 
ccctctgaac 
ggaggcttac 
attccagctt 
tgctgctgga 
ggaagtgatg 
gggcagggca 
ctaaaccgcc 
agatgcctgt 
ggcctaggct 
acattctttc 
tggggcaagg 
tttaggttat 
taatattctt 
ctcctcttct 
actaaagcac 
tcaaagagct 
atacattttt 



ccaatcccac 
gtgtcctgac 
tgccgggctc 
ctcggtggtg 
gctgcccgtt 
ccagacgacg 
catcaacgtg 
gccccgcgtg 
gcccgccgcc 
atgcttcgag 
ggccgaggcg 
cttctggacg 
cctcctgctg 
ggcggtctac 
cgtgcgcggg 
gctggtgtac 
gcaccgggcc 
gtccgccgtc 
ctccgactcc 
acacatgcca 
agggtgtaca 
agcaacgcag 
aatggcttct 
cagagagccc 
ttaaactgct 
acaggggcct 
tcatctgctg 
tcccctgctc 
tcgcatagtg 
caaaactaac 
cctttcttcc 
tctttttctc 
ctctctctcc 
tggttctcaa 
ttgcttatat 
aaaataaaaa 



tttggcacga 
taccgaccta 
cccctcaacg 
agcgtgtaca 
cgtctctcct 
ggcgccatct 
gaccgctacg 
gcgcggctgc 
cgcgtgcaca 
agcttcagcg 
ctgggcttcc 
ctggcgcgcc 
gctaacctcg 
gggctgctgc 
gtgctgatgg 
tactttagcg 
aggacctcgg 
accaccgacg 
cactctctgt 
ttgcgctgtc 
cacaagaagg 
aagagtacaa 
ttaaactgtg 
acccgtgcag 
aaaagctggt 
gagagctgaa 
acttctgtct 
accactgagg 
acttgtgaca 
ttagagtttc 
atcacatccc 
tctctctcgt 
cgctctctcc 
actttttggt 
tttgttcttt 
aaaa 



tgttagccaa 
cccaccgcct 
cgctagccct 
tgtgtaacct 
actacgcact 
tccagatgaa 
ccgccatcgt 
tctgcctggg 
ggccctcgcg 
acgagctgtg 
tgctgcccct 
ccgacgccac 
tcatcttcct 
ggagcaagct 
tgatggtgct 
ccgagggctt 
ccaccaacgg 
ccaccaggcc 
cttccttcac 
cgtgcccgac 
tgggctgggc 
agtgtggaag 
agcacgcaga 
agggcagaag 
tagatggaac 
gagcaccagg 
aggctcatgg 
cctaatacaa 
atttagactt 
cccctcaaca 
cttttccagg 
ttctctcttc 
tttgtccaga 
ctcagacccc 
tgatacttac 



360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2444 



<210> 10 

<211> 1014 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 6575963CB1 



<400> 10 

atgaatgagc 

tttggaaatt 

ggcattatct 

aaaatgagac 

ctgtatctga 

tttggagatt 

atcctcttcc 

tgcttttcca 

tcactggtag 

tcagcctgtc 

attttgactg 

attatccaca 

aggctaacca 

agggtcattc 

catgaagctt 

ctatatgtgg 



cactagacta 
gcactgatga 
tcctcgtggg 
cttggaagag 
ccagcctccc 
tcatgtgtaa 
tcacctgttt 
ttcacaaaac 
ctgtcattcc 
tcgacctcac 
caactacttt 
ctctgaccca 
ttctgctact 
ggatcgaatc 
acatcgtttc 
tggtcagcga 



tttagcaaat 
aaacatccca 
atttccaggc 
cagcaccatc 
cttcctgatt 
gtttatccgc 
cagcatcttc 
tcgatgtgca 
gatgaccttc 
cagttcggat 
ctgcctcccc 
tggactgcaa 
ccttgcattt 
tcgcctgctt 
tagaccatta 
caactttcag 



gcttctgatt 
ctcaagatgc 
aatgcagtag 
attatgctga 
cactactatg 
ttcagcttcc 
cgctactgtg 
gttgtagcct 
ttgatcacat 
gaactcaata 
ttggtgatag 
actgacagct 
tacgtatgtt 
tcaatcagtt 
gctgctctga 
caggctgtct 



tccccgatta 
actacctccc 
tgatatccac 
acctggcctg 
ccagtggcga 
atttcaacct 
tgatcattca 
gtgctgtggt 
caaccaacag 
ctattaagtg 
tgacactttg 
gccttaagca 
ttttaccctt 
gttccattga 
acacctttgg 
gctcaacagt 



tgcagctgct 
tgttatttat 
ttacattttc 
cacagatctg 
aaactggatc 
gtatagcagc 
cccaatgagc 
gtggatcatt 
gaccaacaga 
gtacaaccta 
ctataccacg 
gaaagcacga 
ccatatcttg 
gaatcagatc 
taacctgtta 
gagatgcaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 
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gtaagcggga 



accttgagca 



igcaaagaaa 



attagttact caaacaaccc ttga 



1014 



<210> 11 

<211> 1° 83 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> » isc ^ e S U S- 7474846CB1 
<223> incyte ID no. 



<400> 11 
ccctgggctg 
tgccgcatcg 
gtgctgggcg 
tggaagccca 
tgcctgcctt 
ccctgccgag 
acggtggtgg 
atctccaccc 
acagtgtatc 
agcttcatca 
atgcccctcg 
cagcagctgg 
attgtgttca 
gtgccctcga 
acctacatga 
aaattctaca 
acacaaaggc 
gcaaatagtt 

tga 



ctctgcaccc ggacacttgc 
agggggacac catctcccag 
caftaggcaa tgggjtcgcc 
gcactgttta ccttttcaat 
"ttcggacaga ctattacctc 
tggggctctt cacgttggcc 
ctgcggacag gtatttcaaa 
gggtggcggc tggcatcgtc 
???tgctgga gaaccatctc 
tggagtcggc caatggctgg 
gStcatctt attttgctcc 
Scagacaggc tcggatgaag 
tcacatgcta cctgcccagc 
gtgcctgcga tccctctgtc 
acagcatgct ggatcccctg 
acaagctcaa aatctgcagt 
cggaagagat g^caatttcg 
tccaaagcca gtctgatggg 



tctgtccccg 
gtgatgccgc 
ctgtgtggtt 
ttggccgtgg 
agacgtagac 
atgaacaggg 
gtggtccacc 
tgcaccctgt 
tgcgtgcaag 
catgacatca 
ttcaagattg 
aaggcgaccc 
gtgtctgcta 
catggggccc 
gtgtattatt 
ctgaaaccca 
aacctcggtc 
caatgggatc 



ccatgtacaa 
cgctgctcat 
tctgcttcca 
ctgatttcct 
actgggcttt 
ccgggagcat 
cccaccacgc 
gggccctggt 
agacggccgt 
tgttccagct 
tttggagcct 
ggttcatcat 
gactctattt 
tgcacataac 
tttcaagccc 
agcagccagg 
gcaggagttg 
cccacattgt 



cgggtcgtgc 60 
tgtggccttt 120 
catgaagacc 180 
ccttatgatc 240 
tggggacatt 300 
cgtgttcctt 360 
ggtgaacact 420 
catcctggga 480 
ctcctgtgag 540 
ggagttcttt 600 
gaggcggagg 66U 
ggtggtggca 720 
cctctggacg 780 
cctcagcttc 840 
ctcctttccc 900 
acactcaaaa 960 
catcagtgtg 1020 
tgagtggcac 1080 



<210> 12 

<211> l^ 40 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> raise fe ature^ 5CBl 
<223> Incyte ID wo. 



<400> 12 
ggcagtgcac 
ggacgctctc 
caggaggttt 
agaggctgaa 
ccttcccgga 
caggctgtgt 
ggaagatcat 
aaatggaaga 
caggcatagc 
tccgatccct 
tcctctgtgg 
gtgactgtga 
gecaagaate 
tactgtacct 
tagtcaatgc 
gctgcccgct 
ctggaattta 
aaatggtgaa 
catcatcatt 
tcatccacat 
ccatggagtg 



gctcagacgc 
tgcgcgggcc 
gtccccgccc 
gctccacctg 
ctacgaggcc 
tcgtcccccg 
tcataatttc 
agggctgaag 
ccttttgtac 
ggattacgta 
ggatgctggc 
gtcccaggaa 
agaccttcct 
gaacacagag 
tattattgaa 
gttgtaccag 
ctatatgtta 
acccagtatt 
aagcaatgaa 
getcatgeag 
tagcgatgtg 



cccgctcctc 
cteggaggag 
gcgcgccgta 

ggaggggagg 
gccgccgggg 
gcgaccacgg 
ataagaegga 
aeggctgate 
ctgcagttgt 
aaaagaacac 
cccctggctg 
tgtgtcacaa 
gatgagctgc 
ataggtccag 
tegggtaaga 
tggcaccgga 
atgeagcegg 
gattatgtgc 
acagaccggc 
gegtacaagg 
atttggcagc 



ccgccagcgc 
geggeggegg 
ccgcggcgga 
cagaaatgga 
cgctgctcgc 
atgagecegg 
tccagaccaa 
cccatgactg 
accgggtcac 
tteggaatet 
ttggagctgt 
aacttttgea 
tttatggacg 
gcaccgtgtg 
ctttgtcaag 
ageagtaegt 
cagcaaaagt 
gecacaaaaa 
tggtgcactg 
tctttaagga 
gaggtttget 



gcggcctcgc 
ggcgagctgc 
gatgggegag 
ggaacgggcg 
ctccggagcg 
cctccctttt 
aattaaagat 
etctgettat 
atgtgaccaa 
gaatggccgc 
gatttatcac 
gctccagaga 
ggcaggttat 
tgagtcagct 
ggaagaaaga 
tggagcagee 
ggaccaagaa 
attccgatct 
gtgccacggc 
ggagaagtac 
geggaaggge 



tcctcctaga 60 
agegceggga 120 
accatgtcaa 180 
ttcgtcaacc 240 
gecgaagaga 300 
catcaggacg 360 
cttctgcagc 420 
actggctgga 480 
acctacctgc 540 
agggtcacct 600 
aaactcagaa 6t>u 
tcggttgtct 720 
ctgtatgcct 780 
attaaagagg 840 
aaaaeggage 900 
catggcatgg 960 
accttgacag 1020 
gggaattacc 1080 
gccccggggg 
ttgaaagagg 1200 
tacgggatat 1260 
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gccatgggac 
agaagtacct 
ggtgccgcat 
ttctctctga 
cgaagaggga 
cagattgctt 
tcacaggccc 
acagcaccct 



tgctggcaac 
ctaccgagct 
tcctgacaga 
tgtcctggga 
ttaaaaggtg 
agtgcctgac 
ctctggttag 
catcaatata 



ggctattcct 
tgcaagtttg 
ccctattcgc 
ccagagacat 
caaaaagaca 
acagaaacaa 
actagcatga 
aaatatgact 



tcctgtccct 
cagagtggtg 
tctttgaagg 
cacggtttcc 
actaaaatac 
ctgggaatcc 
gtgaccgaag 
tcttcacata 



ttaccgtctc 
tctagattac 
catggctggc 
agcatttgaa 
ccatttggac 
tgaaagagaa 
ccatccatca 
cagaaaaaaa 



acgcaggata 1320 
ggagcacacg 1380 
gctattcact 1440 
cttgactctt 1500 
caaaagccgc 1560 
gcagacaccg 1620 
acattttcta 1680 
aaaaaaaaaa 1740 



<210> 13 
<211> 3002 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 1874944CB1 



<400> 13 

gcccagttct 

gcccgtgctc 

gcacctctac 

ctatgggatg 

ggacagttac 

ctttgtagcc 

cagagtcatc 

caagttgacg 

tggcgtgctt 

ctccctgcag 

cgccttcaag 

cgcgaagccc 

gctcagccct 

agccgggagg 

aaatgcccca 

gccccgagac 

cagcctgatg 

acttcctcgg 

cttcatcact 

caggagggga 

tggttcaggt 

gcatgcggag 

gccgtggtgt 

ttgggggccc 

gagggcgacc 

ctgggccttg 

tgagttttgc 

gcctccttgg 

aggtggaaag 

cacaggcact 

cgtctggtca 

ggtgacagcc 

agatgggata 

atcatagatg 

tagcactcgt 

aactcttcct 

ccttgtttca 

ttcttaatgt 

tgcatgggga 

caggcgaggc 

gtatggcccc 

gttcagtgaa 

gtctcccgtg 

ggtgagcacc 



ggtgctcata 
ttacactact 
agcatggtga 
ggatggggtt 
ggaacaagca 
cctgccctgt 
tcacagatca 
gccaaggcag 
gctgtcaacg 
ggactgttca 
cacaaaacca 
ttccactcgg 
tgggacaaga 
ctgccaacca 
cctttgccca 
agctgtcctc 
cccaggccag 
gggattccca 
cagcatcaga 
tgttcagcct 
ccccacgcac 
tccccagtgt 
gtgtctctgt 
ctgcccaagc 
cctctgccct 
aggctgcctg 
ctctttggac 
ttgttcttgg 
tggagggcat 
ctacgaagtt 
ccatgagacc 
gtgtcttctg 
attgttatgg 
tgaattaaga 
ggtgtctgtg 
ttaaaactgt 
gagtgagttt 
aactcttccc 
gggtaggcag 
ccaggctgca 
tgcaaccgtg 
gagtcccatg 
accccagaaa 
tagaagtgag 



gtttccgctt 
tcttcctgag 
tcaaggtctt 
ttcctcttct 
acaattgctg 
ttgtcatcgt 
gcgccgacaa 
tggccgtgct 
gttgtgctgt 
tattcctctt 
aggtctggtc 
acctcatgaa 
gcagccactc 
ggccaggctg 
tggaccctct 
ccctgtgact 
cgtgggccct 
ggacacagtg 
cccagcgagg 
ctgtgccttg 
cctcagtcag 
ctgaatccac 
aggtggtgcc 
cgagctcgag 
gtccttgcgg 
atggtgcctg 
cccaattcgg 
gccacaggag 
tttccagggc 
tctaatgggc 
gacctgcgct 
tgttgaggga 
acccatgtgt 
caccaccgag 
aaatgtgggt 
gaccatgatt 
tctatggagc 
ctggtcgcct 
ggagcagcat 
ccagccacct 
ctctggcggg 
tttagtatgg 
ccaggtcaca 
aacactgtat 



tgagccgggc 
tgccttcgca 
tgggtcggag 
gatctgcatc 
gctgtcgttg 
ggtcaacatt 
ctacaagatc 
gctgcccatc 
ggttttccag 
tcattgtctc 
gctcacgagc 
tgggacccgg 
tgcccaccgc 
cgctcagaac 
ccttgctgct 
ctggctgtcg 
cctgccttgc 
gcctgactgt 
ccaggacact 
gtggggcttg 
gcgcaggcag 
tgagtggtga 
ggcgtgggcc 
ccgtgggcgg 
gggtcccctc 
tgcttggggg 
ccttaagatg 
ctggccgtgt 
actgctttcc 
agaccacgcg 
gagtccccac 
aatttatgga 
gggcatgatc 
atacgggctg 
aagacattca 
tcattcagcc 
ctgtggccct 
ggagtggacc 
gtctgcaggg 
gccacatggt 
cacacctggc 
actaaagtcc 
tggaccacag 
tcctacaatg 



acgaccccct 
tggatgctgg 
gacagcaagc 
atttcactgt 
gcgagtggcg 
ggcatcctca 
catggagacc 
ctgggtacct 
tacatgtttg 
ctgaattcag 
agctccgccc 
ccaggcatgg 
gtcgacctgt 
acaccccccc 
gtctggacat 
gagcacactg 
atccacccgt 
gatggtgccc 
cggggccggt 
gggactcagg 
ctgggggtgt 
gttccccaca 
aacctgtgct 
gagtcgttga 
tgctcacgtg 
agcttctcgg 
ccctcctccc 
ccccgcagtg 
ccagaggctt 
gcaggtagca 
tgacctggag 
ctcagactca 
ctgtggaaca 
tgaggttcat 
aacctggttt 
cctccacacc 
tttgcagccc 
actcatctgc 
gtgaaccttt 
gacagtgcca 
tgctgcaggc 
catgtttagc 
tgccagatcc 
tacacttgga 



gccaagtgat 
tggagggtct 
accgttacta 
catttgccat 
ccatctgggc 
tcgctgtgac 
ccagtgcctt 
cgtgggtctt 
ccacgctcaa 
aggtgagagc 
gcacctccaa 
cctccaccaa 
cagccgtgtg 
aaacagaatg 
gggtgttgtg 
ctcagcccag 
gggctgagtg 
ttgagcctcc 
cccgcagcac 
gccaaagagg 
gtggggaaga 
gccggcgcta 
gtgtcatcag 
ctctccaggt 
aagagccgct 
ccatccgctg 
tcgtgtgcca 
cctggtgtcc 
cctcatggct 
cagtgcgctc 
agggagggct 
gccccagagg 
caggtttggg 
actgtgctga 
tgatactgga 
cctatgtctg 
acctggtggc 
aggcctctcc 
gctcttctgt 
cgggccctgc 
caaggccgct 
cactgcccca 
tcatcacgcc 
tatttctcct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 
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tatttagttt 
ttgtttttaa 
aaaacctgtc 
ctatgttttg 
attggaaaga 
aaatatgttg 

aa 



ctagtgaaac 
ttgttgccgt 
taaacctttt 
gcttctgcag 
atgaaatgtt 
tgatttttat 



aaatcaagta 
attcatctat 
gtttccaatg 
tacttttatt 
tagtttatag 
gaaataaact 



aggaactatc 
atagctaata 
aatgaaagtc 
atctatacat 
tagaagaaag 
catgtcctga 



tttagtttag 
tttcaagata 
atgcacttta 
aatttggcca 
atgatgacac 
aaaaaaaaaa 



atggaattat 2700 
agtaatgaac 2760 
tttataggct 282U 
aaaataagaa 2880 
taagttgtga 2940 
aaaaaaaaaa 3000 



<210> 14 
<211> 965 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc__feature 
<223> incyte IP No: 



7475270CB1 



<400> 14 
atgggaagat 
cacagccaga 
tgtgggaatg 
tacttcttcc 
aagatggcag 
caaattggct 
tatgaccgct 
gtctgtctcc 
atggtggcag 
gctgtgctcc 
cacctaacag 
cgctaccggg 
atgctgaacc 
gggctggacc 

gCCCCttCtt 

ttcacctgtt 
atgaa 



gggtgaacca 
ctgaccttgt 
tcctcctcat 
tcagccagct 
ccaacttcct 
tttttgtctc 
acgtggccgt 
agattactgg 
ccatgggctt 
gaatacgctc 
ctgtcaccct 
cccctagcca 
ccctcattta 
gctgcaggat 
gcctgtgtca 
gatggtcatt 



gtcctacaca 
cctcttctct 
cttcctcatc 
ctccctcatg 
gtctggcagg 
tcttgtggga 
tagccaccca 
gagctcctgg 
accttactgt 
tgctcaggcc 
cttctatggg 
tgacaaggtg 
cagcttgagg 
tggcagccag 
cattgggaag 
ggatggttta 



gatggcttct 
gcagttatgg 
tacctggacg 
gacctcatgt 
aagtccatct 
tctgaggggc 
cttcactatc 
gcctttggga 
ggctcaagga 
tggaaaaaag 
gcagccatgt 
gcctctatct 

aatggggagg 
cactgaaccc 
tcactcaacc 
taggttttgg 



tcctcttggg 
tggtcttcac 
ctggacttca 
tggtctgtaa 
cctttgtggg 
tcttgctggg 
ccatcctcat 
taatagatgg 
gcgtggatca 
ccctggccac 
tcatgtacct 
tctacacagt 
tgatgggggc 
cagagtctgg 
tttgtgagtg 
ccatcatgag 



catcttttcc 60 
agtggccctc 120 
cacccccatg 180 
cattgtgcca 240 
ctgtggcata 300 
actcatggct 360 
gaatcagagg 420 
agtgattcag 480 
ctttttctgg 540 
ctgctcctcc 600 
gaggcctagg 660 
ccttactccc 72U 
actgaggaag 780 
tgcctgctgt 840 
tctgtttcca 900 
aaaagccatt 960 
965 



<210> 15 
<211> I 617 
<212> BNA 

<213> Homo sapiens 

<22?> » isc r^ U S- 55000189CB1 
<223> mcyte ID no. 



<400> 15 
gttgcctttt 
tgtgttgtgc 
gcttttgcca 
gggccgccag 
ttctaattca 
cccaggatgg 
tccatgcgct 
tcacatattg 
gaagacagca 
atgctcctga 
gctctatccc 
attgctttca 
tttgttttcc 
ctcatcctgt 
gaccgtcacc 
gtgtacacac 
atctccctag 



tfl tctt«flt cggtgttctg tgj£££ 
gttttctgtt gtgtttttgt gttgg 
?tgtttacgg ccaggctttg gttccg g 
ttgtgtggct gj^ctjc ^f fc ? tttt 
gtaagtccaa ctctetcacc ccatgcatat 

catgcaatgg 9 ac °°"J J tg g C actttg 
acaaaacagc acatttatga tgg 
acaattcctt g^ccagaagc ttaacag ^ 
aaaacagaaa agggagaaga tggtg 8^ 
ttacaaagca jagaaattga 
tgttctagaa ttcaagtcat y ga 
gScattgcaa ttaaagaatg cgaagjj^ 
tgggattctc cagattcwc aa a 
acacattaac tgtggctggc aa a 
tccacactcc catgtacttc ttcctg g 
tgttcatcat tccacagatg £ctcc J 
caggttgtac cacccaaacg ttc« 



tttgtttctc 
ctgttctgcc 
tcggattcca 
cagaagctat 
gtctcagctg 
tgtaaaggaa 
aaaagatatg 
cagcagtgcc 
gctatataac 
tagtgttttc 
tcttcacgtt 
acctcacaga 
tcactctctt 
tcatgaccat 
tgctggctag 
tcgtaaccca 
ttaccttggc 



tgttgttgtt 60 
aggttccagc 120 
tttagtaact 180 
ccacctatgg 240 
tgtgggcttt 300 
aatgcctccc 360 
ggctgtggtg 420 
agaagattaa 480 
atgagcgaga 540 
tatagtcatt 600 
gacttcctgc 660 
ggtaacagag 720 
tgtggttttt 780 
catctgcatt 840 
ctcaaagaca 900 
gacccagcca 960 
catcaacaat 1020 



12/13 



WO 01/87937 



PCT/USO 1/1 6285 



tgcttcttgc 
tacagggtca 
ggcctggcca 
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